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only TJr. if two or more Authorities are 



The demand must be filed directly with the competent International Preliminary Examining Aulhdhty TJr. ij two or more Authorities are 
competent, with the one chosen by the applicant. The full name or two-letter code of that Authority m aybe i ndicated by the applicant on the line below: 



IPEA/ 



PCT 

DEMAND 

under Article 3 1 of the Patent Cooperation Treaty: 
The undersigned requests that the international application specified below be the subject of 
international preliminary examination according to the Patent Cooperation Treaty and 
hereby elects all eligible States (except where otherwise indicated). 



T "9 J m 9001 

CHAPTER II 



Identification of IPEA 


Date of receipt of DEMAND 


Box No. I IDENTIFICATION OF THE INTERNATIONAL APPLICATION 


Applicant's or agent's file reference 
3206-172/PAR 


International application No. 
PCT/CA99/00656 


International filing date (day/month/year) 
20 July 1999 ( 20.07.99 ) 


(Earliest) Priority date (day/month/year) 
20 July 1998 ( 20.07.98 ) 



Title of invention 
SARA PROTEINS 



Box No. II APPLICANT(S) 



Name and address: (Family name followed by given name; for a legal entity, full official 
designation. The address must include postal code and name of country.) 

HSC Research and Development Limited Partnership 

555 University Avenue 

Suite 5270 

Toronto, Ontario 

M5G 1X8 

Canada 



Telephone No.: 



Facsimile No.: 



Teleprinter No.: 



State (that is, country) of nationality: 


State (thai is, country) of residence: 


CA 


CA 


Name and address: (Family name followed by given name; for a legal entity, full official designation. The address must include postal code and 


name of country.) 




WRANA, Jeffrey L. 




c/o HSC Research and Development Limited Partnership 




555 University Avenue 




Suite 5270 




Toronto, Ontario 




MSG 1X8 Canada 




State (that is, country) of nationality: 


State (that is, country) of residence: 


CA 


CA 



Name and address: (Family name followed by given name; for a legal entity, full official designation. The address must include postal code and 
name of country .) 



State (that is, country) of nationality: 


State (that is, country) of residence: 


j | Further applicants are indicated on a continuation sheet. 


Form PCT/IPEA/401 (first sheet) (July 1998; reprint January 2000) 


Legaistar 2000, Form pctdem See Notes to the demand form 



s 



Sheet No. .?. 



jternational application No. 

PCT/CA99/00656 



Box No. Ill AGENT OR COMMON REPRESENTATIVE; OR ADDRESS FOR CORRESPONDENCE 



The following person is E3 a i ent common representative 

and Kl has been appointed earlier and represents the appiicant(s) also for international preliminary examination. 

is hereby appointed and any earlier appointment of (an) agent(s) /common representative is hereby revoked. 

□ is hereby appointed, specifically for the procedure before the International Preliminary Examining Authority, in 
addition to the agent(s)/common representative appointed earlier. _ 



Name and address: (Family name followed by given name; for a legal entity, full official designation 
The address must include postal code and name of country.) 

Patricia A. Rae (Dr.) 
SIM & McBURNEY 
6th Floor 

330 University Avenue 
Toronto, Ontario 
M5G1R7 
Canada 



Telephone No.: 
416-595-1155 



Facsimile No.: 
416-595-1163 



Teleprinter No.: 



□ Address for correspondence: Mark this check-box where no agent or common representative is 
the space above is used instead to indicate a special address to which correspondence should be 



is/has been appointed and 
sent. 



Box No. IV BASIS FOR INTERNATIONAL PRELIMINARY EXAMINATION 



Statement concerning amendments:* 

1 The applicant wishes the international preliminary examination to start on the basis of: 
the international application as originally filed 
the description QG as originally filed 

[ | as amended under Article 34 

the claims Kl as o ri g inalI >' flled 

| | as amended under Article 19 (together with any accompanying statement) 

| | as amended under Article 34 

the drawings 1X1 as originally filed 

| | as amended under Article 34 
2. | 1 The applicant wishes any amendment to the claims under Article 19 to be considered as reversed. 
3 | | The applicant wishes the start of the international preliminary examination to be postponed until the expiration of 
20 months from the priority date unless the International Preliminary Examing Authority receives a copy of any 
amendments made under Article 19 or a notice from the applicant that he does not wish to make such amendments 
(Rule 69. 1(d)). (This check-box may be marked only where the time limit under Article 19 has not yet expired.) 
* Where no check-box is marked, international preliminary examination will start on the basis of the international application as 
originally filed or. where a copy of amendments* to* the claims under Article* !? and/or 'amendments of the international 
application under Article 34 are received by the International Preliminary Examining Authority before it has begun to draw up 
a written opinion or the international preliminary examination report, as so amended. 



Language for the purposes of international preliminary examination: English 

1X1 which is the language in which the international application was filed. 

1 | which is the language of a translation furnished for the purposes of international search. 

| | which is the language of publication of the international application. 

| | which is the language of the translation (to be) furnished for the purposes of international preliminary examination. 



Box No. V ELECTION OF STATES 



The applicant hereby elects all eligible States (that is, all Slates which have been designated and which are bound by Chapter J! of the 
PCT) 

excluding the following States which the applicant wishes not to elect: 



Form PCT/IPEA/401 (second sheet) (July 1 998; reprint January 2000) Le9a ' Star 2 °°°' F ° rm P ° TDEM See Notes to the demand form 









International application No. 


• 


Sheet No. 


• 


PCT/CA99/00656 



The demand is accompanied by the following elements, in the language referred to in 
Box No. IV. for the purposes of international preliminary examination: 

1. translation of international application sheets 


For International Preliminary 
Examining Authority use onl\ 

received not received 

□ □ 


2. amendments under Article 34 


sheets 


□ 


□ 


3. copy (or, where required, translation) 
of amendments under Article 19 


sheets 


□ 


□ 


4. copy (or, where required, translation) 
of statement under Article 19 


sheets 


□ 


□ 


5. letter : 


1 sheets 


□ 


□ 


6. other (specify) ' 


sheets 


□ 


□ 



Box No. VI CHECKLIST 



The demand is also accompanied by the item(s) marked below: 

1. |^| fee calculation sheet 

2. | | separate signed power of attorney 

3 I I copy of general power of attorney; 
I 1 re f erence number, if any: 



4. [ | statement explaining lack of signature 

5 I I nucleotide and or amino acid sequence listing in 
* — ' computer readable form 

other (specify): bank draft 



Box No. VII SIGNATURE OF APPLICANT, AGENT OR COMMON REPRESENTATIVE 



Next to each signature, indicate the name of the person signing and the capacity in wfych the person signs (if such capacity is not 
obvious from reading the demand). 



SIM & McBURNEY 



Patricia A. Rae (Dr.) 



1. 


Date of actual receipt of DEMAND: Jggg ^gg'tf F>£F/FTO 'f. 9 JAN 26! 


2. 


Adjusted date of receipt of demand due 
to CORRECTIONS under Rule 60.1(b): 


3. 


1 1 The date of receipt of the demand is AFTER the expiration of 19 months 1 I The applicant has been 
1 1 from the priority date and item 4 or 5, below, does not apply. 1 1 informed accordingly. 


4. 


1 1 The date of receipt of the demand is WITHIN the period of 19 months from the priority date as extended by virtue of 

1 1 Rule 80.5. 


5. 


1 1 Although the date of receipt of the demand is after the expiration of 19 months from the priority date, the delay in arrival is 

1 1 EXCUSED pursuant to Rule 82. 



For International Bureau use only 



Demand received from IPEA on: 

Form PCT/IPEA/401 (last sheet) (July 1998; reprint January 2000) Legaistar 2000. Form pctdem See Notes w tne demandform 



PCT 



CHAPTER II 



FEE CALCULATION SHEET 
Annex to the Demand for international preliminary examination 

- For International Preliminary Examining Authority use only 



International 
application No. 



PCT/CA99/00656 



Applicant's or agent's 
file reference 



3206-1 72/PAR 



Applicant 

HSC Research and Development Limited Partnership 



Calculation of prescribed fees 



1. Preliminary examination fee 

2. Handling fee (Applicants from certain States are 
entitled to a reduction of 75% of the handling fee. 
Where the applicant is (or all applicants are) so 
entitled, the amount to be entered at H is 25% of the 
handling fee .) 

3. Total of prescribed fees 

Add the amounts entered at P and H 

and enter total in the TOTAL box 



Mode of Payment 






j 1 authorization to charge deposit 

1 | account with the IPEA (see below) 


□ 


cash 


| | cheque 


□ 


revenue stamps 


| | postal money order 


□ 


coupons 


|Xl bank draft 


□ 


other (specify): 




Deposit Account Authorization (this mode of payment may not be available at all IPEAs) 



The IPEA/ 




is hereby authorized to charge the total fees indicated above to my deposit account. 



□ (this check-box may be marked only if the conditions for deposit accounts of the IPEA so permit) is 
hereby authorized to charge any deficiency or credit any overpayment in the total fees indicated 
above to my deposit account. 



Deposit Account Number Date (day/month/year) Signature 

Form PCT/IPEA/40 1 (Annex) (July 1 998; reprint January 2000) Legaistar 2000. Form pctdfee See Notes to the fee calculation sheet 



PATENT COOPERATION TREATY 



From the „, xw 

INTERNATIONAL PRELIMINARY EXAMINING AUTHORITY 



To: 

RAE, Patricia A. 
Sim & McBumey 
330 University Avenue 
6th floor 

Toronto, Ontario M5G1R7 
CANADA 



RECEIVE! 

SEP 22 2000 



SIM & NPBURNEY 
SIM, HUGHES, ASHTON & McKAY 



^9/744167 

PCT 



NOTIFICATION OF TRANSMITTAL OF 
THE INTERNATIONAL PRELIMINARY 
EXAMINATION REPORT 

(PCT Rule 71.1) 



Applicant's or agenf s file reference 
3206-172/PAR 



Date of mailing 
(day/month/year) 



19.09.2000 



IMPORTANT NOTIFICATION 



International application No. 
PCT/CA99/00656 



International filing date (day/month/year) 
20/07/1999 



Priority date (day/month/year) 
20/07/1998 



Applicant 

HSC RESEARCH AND DEVELOPMENT LIMITED ... et al. 



1 The applicant is hereby notified that this International Preliminary Examining Authority transmits herewith the 
" international preliminary examination report and its annexes, if any, established on the international application. 

2. A copy of the report and its annexes, if any, is being transmitted to the International Bureau for communication 
to all the elected Offices. 

3 Where required by any of the elected Offices, the International Bureau will prepare an English translation of the 
report (but not of any annexes) and will transmit such translation to those Offices. 

4. REMINDER 

The applicant must enter the national phase before each elected Office by performing certain acts (filing 
translations and paying national fees) within 30 months from the priority date (or later in some Offices) (Article 
39(1)) (see also the reminder sent by the International Bureau with Form PCT/IB/301). 

Where a translation of the international application must be furnished to an elected Office, that translation must 
contain a translation of any annexes to the international preliminary examination report. It is the applicant s 
responsibility to prepare and furnish such translation directly to each elected Office concerned. 

For further details on the applicable time limits and requirements of the elected Offices, see Volume II of the 
PCT Applicant's Guide. 



Name and mailing address of the IPEA/ 



European Patent Office 
D-80298 Munich 

Tel. +49 89 2399 - 0 Tx: 523656 epmu d 
Fax: +49 89 2399 - 4465 



Authorized officer 
Vullo, C 

Tel. +49 89 2399-8061 




Form PCT/IPEA/416 (July 1992) 



gATENT COOPERATION TR^g /7441 Q 7 

PCT 

INTERNATIONAL PRELIMINARY EXAMINATION REPORT 

(PCT Article 36 and Rule 70) 



Applicant's or agent's file reference 
3206-1/2/PAR 



International application No. 
PCT/CA99/00656 



See Notification of Transmittal of International 
FOR FURTHER ACTION Preliminary Examination Report (Form PCT/IPEA/416) 



International filing date (day/month/year) 
20/07/1999 



Priority date (day/month/year) 
20/07/1998 



International Patent Classification (IPC) or national classification and IPC 
C12N15/12 



Applicant 

HSC RESEARCH AND DEVELOPMENT LIMITED ... et al. 



1 . This international preliminary examination report has been prepared by this International Preliminary Examining Authority 
and is transmitted to the applicant according to Article 36. 

2. This REPORT consists of a total of 8 sheets, including this cover sheet. 

□ This report is also accompanied by ANNEXES, i.e. sheets of the description, claims and/or drawings which .have 
JeenTended and are the basis for this report and/or sheets containing rectifications made before this Authority 
(see Rule 70.16 and Section 607 of the Administrative Instructions under the PCT). 

These annexes consist of a total of sheets. 



3. This report contains indications relating to the following items: 



I 




II 


□ 


III 


□ 


IV 


IS 


V 


B 


VI 


□ 


VII 


□ 


VIII 





Lack of unity of invention 

Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations suporting such statement 



Certain observations on the international application 



Date of submission of the demand 
17/02/2000 



Date of completion of this report 
19.09.2000 



Name and mailing address of the international 
preliminary examining authority: 

European Patent Office 

D-80298 Munich 

Tel. +49 89 2399 - 0 Tx: 523656 epmu d 
Fax: +49 89 2399 - 4465 



Authorized officer 
Lanzrein, M 

Telephone No. +49 89 2399 7358 




Form PCT/I PEA/409 (cover sheet) (January 1994) 



INTERNATIONAL PRELI 
EXAMINATION REPORT 



ARY 



International application No. PCT/CA99/00656 



I. Basis of the report 

1 This report has been drawn on the basis of (substitute sheets which have been furnished to the receiving Office in 
rZonse to " inZtion under Article 14 are referred to in this report as "originally filed" and are not annexed to 

the report since they do not contain amendments.) : 

Description, pages: 

-I _62 as originally filed 



Claims, No.: 

1-44 

Drawings, sheets: 

1/20-20/20 



as originally f iled 



as originally filed 



2. The amendments have resulted in the cancellation of: 

□ the description, pages: 

□ the claims, Nos.: 

□ the drawings, sheets: 



3. 



□ This report has been established as if (some of) the amendments had not been made, since they have been 
considered to go beyond the disclosure as filed (Rule 70.2(c)): 



4. Additional observations, if necessary: 



IV. Lack of unity of invention 

1 . In response to the invitation to restrict or pay additional fees the applicant 

□ restricted the claims. 

□ paid additional fees. 

□ paid additional fees under protest. 

□ neither restricted nor paid additional fees. 



Form PCT/IPEA/409 (Boxes l-VIII, Sheet 1) (January 1994) 



INTERNATIONAL PRELIMINARY 
EXAMINATION REPORT 



International application No. PCT/CA99/00656 



2 H This Authority found that the requirement of unity of invention is not complied and chose, according to Rule 
68.1 , not to invite the applicant to restrict or pay additional fees. 

3. This Authority considers that the requirement of unity of invention in accordance with Rules 13.1, 13.2 and 13.3 is 

□ complied with. 

B not complied with for the following reasons: 
see separate sheet 

4. Consequently, the following parts of the international application were the subject of international preliminary 
examination in establishing this report: 

all parts. 

□ the parts relating to claims Nos. . 



V Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial 
applicability; citations and explanations supporting such statement 



1. Statement 
Novelty (N) 

Inventive step (IS) 



Yes: Claims 3, 5, 9, 10, 21 , 23, 24, 37-44 

No: Claims 1,2,4, 6-8, 1 1 -20, 22, 25-36 

Yes: Claims 

No: Claims 1-44 



Industrial applicability (IA) Yes: Claims 1-44 

No: Claims 



2. Citations and explanations 
see separate sheet 



VIIL Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether the 
claims are fully supported by the description, are made: 

see separate sheet 



Form PCT/IPEA/409 (Boxes I- VIII, Sheet 2) (January 1994) 



INTERNATIONAL PRELIMINARY International application No. PCT/CA99/00656 
EXAMINATION REPORT - SEPARATE SHEET 



Re Item IV 

Lack of unity of invention 

Reference is made to the following documents: 

D1: B. MECKELEIN ET AL: 'Identification of a novel serine protease-like molecule in 
human brain' MOL. BRAIN RESEARCH, vol. 55, no. 2, April 1998 (1998-04), 
pages 181-197. AMSTERDAM, NL & EMBL SEQUENCE DATABASE, 24 May 
1999 (1999-05-24), Cambridge, UK 

D2: NAGASE T ET AL: 'Prediction of the coding sequences of unidentified human 
genes. VII. The complete sequences of 100 new cDNA clones from brain which 
can code for large proteins in vitro' DNA RESEARCH, JP, UNIVERSAL ACADEMY 
PRESS, vol. 4, no. 4, page 141-150. ISSN: 1340-2838 -& EMBL SEQUENCE 
DATABASE, 1 July 1997 (1997-07-01) 



The application lacks unity within the meaning of Art. 34(3) and Rule 13.1 PCT. The 
separate inventions are: 

1) human SARA1: claim 7, (completely); claims 1, 2, 4, 6, 11-19, 20, 22, 23, 25-44 
(partially) 

2) human SARA2: claim 8, (completely); claims 1 , 2, 4, 6, 1 1-19, 20, 22, 23, 25-44 
(partially) 

3) Xenopus SARA1 : claim 9 (completely); claims 1 , 3 , 5, 6, 1 1-19, 21 , 24-44 
(partially) 

4) Xenopus SARA2: claim 10 (completely); claims 1, 3, 5, 6, 11-19, 21, 24-44 
(partially) 

They are not so linked as to form a single general inventive concept (Rule 13.1 PCT) 
for the following reasons: The four inventions concern four related proteins, two from 
human and two from Xenopus. Since the two human proteins are already known from 
D1 and D2 (see also reasoned statement below under item V), there is no common 
inventive concept which is new over the prior art. 

All four inventions have been examined, the lack of unity nonwithstanding. However, a 



Form PCT/Separate Sheet/409 (Sheet 1) (EPO-April 1997) 



INTERNATIONAL PRELIMINARY International application No. PCT/CA99/00656 
EXAMINATION REPORT - SEPARATE SHEET . . 

unity objection would most likely ensue in case of later entry into regional phases of the 
present application. 



Re Item V 

Reasoned statement under Article 35(2) with regard to novelty, inventive step or 
industrial applicability; citations and explanations supporting such statement 



1 



The present application concerns Smad Anchor for Receptor Activation (SARA) 
proteins. The applicants provide the sequences of 2 human SARA proteins 
(hSARAI , hSARA2) and two from Xenopus (XSaral, XSara2). SARA proteins 
bind Smad signaling molecules and thereby recruit them into distinct subcellular 
domains. TGFp signaling induces dissociation of Smad2 from SARA and 
formation of Smad2/Smad4 complex. This complex translocates to the nucleus to 
regulate gene expression. 

Claimed are the polynucleotide and protein sequences, antibodies, transgenic 
animals and various methods of treatment and of screening for allelic variants or 
candidate modulator compounds. 



2. Claims 1 , 2, 4, 6-8, 1 1 -20, 22, 25-36 lack novelty according to Art. 33 (2) PCT. 

2.1 Claims 1, 2, 4, 6, 7, 11-20, 22, 25-36 are not new in view of D1. This document 
discloses cloning and expression of a novel serine protease (NSP). The cDNA 
sequence of NSP is identical to the hSARAI sequence in Table 1 and, in 
consequence, the protein sequences are identical as well. The cDNA of NSP was 
recombinantly expressed in E. coli cells and purified via addition of a six histidine 
tag (p. 193, "3.5 Expression of recombinant..."). The purified protein was used to 
generate monoclonal antibodies (Fig. 6). 

The disclosures in D1 are thus prejudicial to novelty of said claims. 
We would like to emphasize that the functional features and functional domains 
which are included in some of the claims are in this case not relevant for 
assessment of novelty, because said functions are inherently present in the prior 
art molecule of D1. All claims concerned here are directed to a product which 
already existed. The existing product inherently possessed all the features even if 



Form PCT/Separate Sheet/409 (Sheet 2) (EPO- April 1997) 



INTERNATIONAL PRELIMINARY International application No. PCT/CA99/00656 
EXAMINATION REPORT - SEPARATE SHEET . , 

not disclosed explicitly. 

In general, a claim to a substance or composition for a particular use should be 
construed as meaning a substance or composition which is in fact suitable for the 
stated use; a known product which is in fact suitable for the stated use, though it 
has never been described for that use is in fact prejudicial to novelty. 

2.2 Claims 1 , 2, 4, 6, 8, 1 1 -1 6, 25-27 are not new in view of D2, cited by the 
applicant. D2 reports the isolation of 100 large cDNA inserts from a human brain 
library. The clone KIAA 305 (see Table 1, p. 145) is identical in sequence with the 
sequences of human SARA2 in Tables 3 and 4. This clone was 
transcribed/translated in vitro to confirm the size of the predicted ORF (Table 1). 
The above mentioned claims concern only the protein and DNA sequences and 
D2 is therefore prejudicial to novelty. 

2.3 Claims 3, 5, 9, 10, 21, 23, 24, 37-44 are considered novel over the prior art in the 
sense of Art. 33 (2) PCT. 



3. Claims 1 , 3, 5, 6, 9-19, 21 , 23-44 lack inventive step according to Art. 33 (3) PCT 

3.1 Claims 1 , 3 , 5, 6, 9-19, 21 , 24-44 concern the SARA proteins from Xenopus. 

For evaluation of inventiveness of the claims concerning the XSARA1 protein the 
closest prior art document is D1 , which discloses the human sequence of SARA1 . 
Correspondingly, for evaluation of inventiveness of the claims concerning 
XSARA2 the closest prior art is D2. 

The problem underlying the subject matter of the above claims could be seen in 
the provision of a further sequence of a SARA protein, which was solved by the 
provision of the Xenopus sequences. A person skilled in the art does not require 
any inventive skill for the cloning of the corresponding SARA sequences from 
Xenopus, especially when taking into account that it is general common 
knowledge that the corresponding human genes have high degrees of sequence 
similarity. Cloning of the Xenopus sequences with probes derived from the highly 
related human sequences is achieved by the mere use of standard techniques 
without undue difficulties and a high expectation of success. 
Moreover, it was known that the SARA proteins are expressed fairly ubiquitously 



Form PCT/Separate Sheet/409 (Sheet 3) (EPO-April 1997) 



INTERNATIONAL PRELIMINARY International application No. 

EXAMINATION REPORT - SEPARATE SHEET 



PCT/CA99/00656 



e the activity was present in a variety of tissues. It was shown in D1 that NSP was 
expressed in all tissues investigated (Table 1). Also KIAA 305 was shown to be 
expressed in 10 out of 14 tissues investigated. This ubiquitous distribution points 
to an important role for those proteins in essential cellular processes, suggesting 
high conservation during evolution. It is therefore likely that said proteins are 
present not only in mammals but also in other more distant classes of organisms. 
Thus, indeed there was good reason to expect the presence of SARA 
homologues in Xenopus. 

3.2 Claim 23 concerns the purified form of human SARA2 protein. The skilled person 
would, without exercise of inventive skill, purify said protein starting out from the 
cDNA sequence disclosed in D2. There are many standard techniques available 
to the person skilled in the art to perform such purification. 

3.3 Claims 37-44 concern a catalogue of standard applications, which cannot be 
considered inventive since the SARA proteins representing the core of said 
applications are not novel and not inventive. 



Re Item VIII 

Certain observations on the international application 

1. Claims 1-5, 12, 19-22, 29, 31, 33, 35-44 refer to a SARA protein. The protein or 
DNA is specified solely by a name which, however, is meaningless to the person 
skilled in the art. Although the protein was known in the prior art, it was given 
other names therein. It is therefore not clear what the subject matter is. 
In general, the technical feature characterizing a protein or DNA being a chemicj 
product is the sequence. 



2. Claims 12, 18, 19, 28-31 , 33, 35 do not meet the requirements of Article 6 
with regard to clarity. 

Said claims refer to "functional domain", "fragment", or "substantially pure 
polypeptide". These terms are vague and unclear and leave the reader in 



Form PCT/Separate Sheet/409 (Sheet 4) (EPO-April 1997) 



INTERNATIONAL PRELIMINARY International application No. 
EXAMINATION REPORT - SEPARATE SHEET 



PCT/CA99/00656 



as to the meaning of the technical features to which they refer, thereby rendering 
the definition of the subject-matter of said claims unclear. 

3. Claim 40 refers to methods for treatment of the human or animal body. In case of 
an European application this would not be allowable because said methods are 
not regarded as inventions which are susceptible of industrial application. 
However, no unified criteria exist in the PCT Contracting States. The patentability 
can also be dependent upon the formulation of the claims. 

4. In claim 43, claim L01 should probably read claim 42. 
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PCT REQUEST 



3206-1 72/PAR 



Original (for SUBMISSION) • printed on 19.07.1999 03:09:29 PM 



0 

0-1 


For receiving Office use only 

International Application No. 




0-2 


international Filing Date 




0-3 


Name of receiving Office and "PCT 
International Application" 





Form - PCT/RO/101 PCT Request 
Prepared using 



PCT-EASY Version 2.83 
(updated 01.03.1999) 



Petition 

The undersigned requests that the 
present international application be 
processed according to the Patent 
Cooperation Treaty 



Receiving Office (specified by the 
applicant) 



Canadian Patent Office (RO/CA) 



Applicant's or agent's file reference 



3206-172/PAR 



Title of invention 



SARA PROTEINS 



Applicant 

This person is: 

Applicant for 
Name 

Address: 



State of nationality 
State of residence 
Telephone No. 
Facsimile No. 



applicant only- 
all designated States except US 
HSC RESEARCH AND DEVELOPMENT LIMITED 
PARTNERSHIP 
555 University Avenue 
Suite 5270 

Toronto, Ontario MSG 1X8 

Canada 

CA 

CA 

416-813-5724 
416-813-5085 



Applicant and/or inventor 

This person is: 

Applicant for 
Name (LAST, First) 
Address: 



State of nationality 
State of residence 



applicant and inventor 
US only 

WRANA, Jeffrey, L. 

c/o HSC Research and Development Limited 

Partnership 

555 University Avenue 

Suite 5270 

Toronto, Ontario MSG 1X8 

Canada 

CA 

CA 




PCT REQUEST 3206-1 72/par 

Original (for SUBMISSION) - printed on 19.07.1999 03:09:29 PM 



IV-1 


Agent or common representative; or 
address for correspondence 






The person identified below is hereby/has 


agent 




been appointed to act on behalf of the 




appiicant(s) before the competent 






International Authorities as: 




IV-1 -1 


Name (LAST, First) 


RAE , Patricia, A. 


IV-1 -2 


Address: 


Sim & McBurney 

330 University Avenue 

6th Floor 

Toronto, Ontario MSG 1R7 
Canada 


IV-1 -3 


Telephone No. 


416-595-1155 


IV-1 -4 


Facsimile No. 


416-595-1163 


V 


Designation of States 




V-1 


Regional Patent 


AP: GH GM KE LS MW SD SZ UG ZW and any 
other State which is a Contracting State 




(other kinds of protection or treatment, if 
any, are specified between parentheses 




after the designation(s) concerned) 


of the Harare Protocol and of the PCT 
EA: AM AZ BY KG KZ MD RU TJ TM and any 
other State which is a Contracting State 
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Field of the Invention 

The invention relates to a family of proteins, the SARA proteins, which 
bind to receptor-regulated Smad proteins and are involved in appropriate 
localization of these Smad proteins for receptor activation. 

Background of the Invention 

The Transforming Growth Factor-beta (TGFp) superfamily, whose 
members include TGFps, activins and bone morphogenetic proteins (BMPs), 
have wide ranging effects on cells of diverse origins (Attisano and Wrana, 1998; 
Heldin et al., 1997; Kretzschmar and Massague, 1998). Signaling by these 
secreted factors is initiated upon interaction with a family of cell-surface 
transmembrane serine/threonine kinases, known as type I and type II receptors. 
Ligand induces formation of a typel/typell heteromeric complex which permits 
the constitutively active type II receptor to phosphorylate, and thereby activate, 
the type I receptor (Wrana et aL, 1994). This activated type I receptor then 
propagates the signal to a family of intracellular signaling mediators known as 
Smads (Attisano and Wrana, 1998; Heldin et al., 1997; Kretzschmar and 
Massague, 1998). 

The first members of the Smad family identified in invertebrates were the 
Drosophila MAD and the C. elegans sma genes (sma-2, sma-3 and sma-4; Savage 
et al., 1996; Sekelsky et al., 1995). Currently, the family includes additional 
invertebrate Smads, as well as nine vertebrate members, Smadl through 9 
(Attisano and Wrana, 1998; Heldin et al., 1997; Kretzschmar and Massague, 
1998). Smad proteins contain two conserved amino (MH1) and carboxy (MH2) 
terminal regions separated by a more divergent linker region. In general, Smad 
proteins can be subdivided into three groups; the receptor-regulated Smads, 
which include Smad 1, 2, 3, 5 and 8, Mad, sma-2 and sma-3; the common 
Smads, Smad4 and Medea, and the antagonistic Smads, which include Smad6, 7 
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and 9, DAD and daf-3 (Heldin et al., 1997; Nakayama et al., 1998; Patterson et 
al., 1997). 

Numerous studies with vertebrate Smad proteins have provided insights 
into the differential functions of these proteins in mediating signaling. Receptor- 
5 regulated Smads are direct substrates of specific type I receptors and the proteins 
are phosphorylated on the last two serines at the carboxy-terminus within a 
highly conserved SSXS motif (Abdollah et al., 1997; Kretzschmar et al., 1997; 
Liu et al., 1997b; Macias-Silva et al., 1996; Souchelnytskyi et al., 1997). 
Interestingly, Smad2 and Smad3 are substrates of TGFp or activin receptors and 

1 0 mediate signaling by these ligands (Liu et al., 1 997b; Macias-Silva et al., 1 996; 
Nakao et al., 1997a), whereas Smadl, 5 and 8 appear to be targets of BMP 
receptors and thereby propagate BMP signals (Chen et al., 1997b; Hoodless et 
al., 1995; Kretzschmar et al., 1997; Nishimura et al., 1998). Once 
phosphorylated, these Smads bind to the common Smad, Smad4, which lacks 

1 5 the carboxy-terminal phosphorylation site and is not a target for receptor 
phosphorylation (Lagna et al., 1996; Zhang et al., 1997). Heteromeric 
complexes of the receptor-regulated Smad and Smad4 translocate to the nucleus 
where they function to regulate the transcriptional activation of specific target 
genes. The antagonist Smads, Smad6, 7 and 9 appear to function by blocking 

20 ligand-dependent signaling by preventing access of receptor-regulated Smads to 
the type I receptor or possibly by blocking formation of heteromeric complexes 
with Smad4 (reviewed in Heldin et al., 1997). 

Analysis of the nuclear function of Smads has demonstrated that Smads 
can act as transcriptional activators and that some Smads, including Drosophila 

25 Mad, and the vertebrate Smad3 and Smad4, can bind directly to DNA, albeit at 
relatively low specificity and affinity (Dennler et al., 1998; Kim et al., 1997; 
Labbe et al., 1 998; Yingling et al., 1 997; Zawel et al., 1 998). 

Localization of Smads is critical in controlling their activity and Smad 
phosphorylation by the type I receptor regulates Smad activity by inducing 

30 nuclear accumulation (Attisano and Wrana, 1 998; Heldin et al., 1 997; 

Kretzschmar and Massague, 1 998). However, little is known about how Smad 
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localization is controlled prior to phosphorylation and how this might function 
in modulating receptor interactions with its Smad substrates. 

Summary of the Invention 
5 Smad proteins (Smads) transmit signals from transmembrane ser/thr kinase 

receptors to the nucleus. Mammalian and non-mammalian proteins have been 
identified which interact directly with Smads and are designated the Smad 
Anchor for Receptor Activation or SARA proteins. 

The invention provides cDNA sequences encoding this previously 

10 undescribed family of SARA proteins which bind to receptor-regulated Smad 
proteins and ensure appropriate localization of these Smad proteins for 
activation by a Type I receptor of a TGFp, activin or BMP signaling pathway. 

For example, TGFp signaling induces dissociation of Smad2 or Smad3 
from a SARA protein with concomitant formation of Smad2/Smad4 or 

1 5 Smad3/Smad4 complexes and nuclear translocation. In the absence of signaling, 
SARA functions to recruit a particular Smad (eg. Smad2 or Smad3) to distinct 
subcellular sites in the cell and interacts with the TGFp superfamily receptor 
complex in cooperation with the particular receptor regulated Smad. Mutations 
in hSARAI that cause mislocalization of Smad2, and interfere with receptor 

20 association, inhibit receptor-dependent transcriptional responses, indicating that 
regulation of Smad localization is essential for TGFp superfamily signaling. The 
invention provides a novel component of the signal transduction pathway that 
functions to anchor Smads to specific subcellular sites for activation by the Type 
I receptor of the TGFp, activin or BMP signaling pathways. 

25 The SARA proteins are characterised by the presence of three domains, a 

double zinc finger or FYVE domain responsible for the subcellular localisation of 
the SARA protein or SARA-Smad complex, a Smad-binding domain which 
mediates the interaction or binding of one or more species of Smad protein and 
a carboxy terminal domain which mediates association with the TGFp 

30 superfamily receptor. The FYVE domain may bind phosphatidyl inositol-3- 
phosphate. 
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In accordance with one embodiment, the invention provides isolated 
polynucleotides comprising nucleotide sequences encoding SARA proteins. 

In accordance with a further series of embodiments, the invention 
provides an isolated polynucleotide selected from the group consisting of 
5 (a) a nucleotide sequence encoding a human SARA protein; 



10 sequence of Table 2 (hSARAl : Sequence ID NO:2); 

(e) a nucleotide sequence encoding the human SARA amino acid 
sequence of Table 4 (hSARA2: Sequence ID NO:4); 

(f) a nucleotide sequence encoding the Xenopus SARA amino acid 
sequence of Table 6 (XSARA1: Sequence ID NO:6); 

15 (g) a nucleotide sequence encoding the Xenopus SARA amino acid 

sequence of Table 8 (XSARA2: Sequence ID NO:8). 

In accordance with a further embodiment, the invention provides the 
nucleotide sequences of Table 1 (human SARA! or hSARAP . Table 3 (human 
SARA2 or hSARA2), Table 5 (Xenopus SARA1 or XSARA1) and Table 7 (Xenopus 
20 SARA2 or XSARA2). 

In accordance with a further embodiment, the invention provides 
recombinant vectors including the polynucleotides disclosed herein and host 
cells transformed with these vectors. 



25 comprising culturing such host cells to permit expression of a SARA protein- 
encoding polynucleotide and production of the protein. 

The invention also includes polynucleotides which are complementary to 
the disclosed nucleotide sequences, polynucleotides which hybridize to these 
sequences under high stringency and degeneracy equivalents of these 

30 sequences. 



(b) 



a nucleotide sequence encoding a mammalian SARA protein; 
a nucleotide sequence encoding a non-mammalian SARA 
protein; 

a nucleotide sequence encoding the human SARA amino acid 



(c) 



(d) 



The invention further provides a method for producing SARA proteins, 
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In accordance with a further embodiment, the invention provides 
antisense molecules which may be used to prevent expression of a SARA 
protein. Such antisense molecules can be synthesised by methods known to 
those skilled in the art and include phosphorothioates and similar compounds. 
5 The invention further includes polymorphisms and alternatively spliced 

versions of the disclosed SARA genes and proteins wherein nucleotide or amino 
acid substitutions or deletions do not substantially affect the functioning of the 
gene or its encoded protein. 

The invention also enables the identification and isolation of allelic 
1 0 variants or homologues of the described SARA genes, and their corresponding 
proteins, using standard hybridisation screening or PCR techniques. 

The invention provides a method for identifying allelic variants or 
homologues of the described SARA genes, comprising 

choosing a nucleic acid probe or primer capable of hybridizing to a SARA 
1 5 gene sequence under stringent hybridisation conditions; 

mixing the probe or primer with a sample of nucleic acids which may 
contain a nucleic acid corresponding to the variant or homologue; and 

detecting hybridisation of the probe or primer to the nucleic acid 
corresponding to the variant or homologue. 
20 In accordance with a further embodiment, the invention provides 

fragments of the disclosed polynucleotides, such as polynucleotides of at least 
10, preferably 15, more preferably 20 consecutive nucleotides of the disclosed 
polynucleotide sequences. These fragments are useful as probes and PCR 
primers or for encoding fragments, functional domains or antigenic determinants 
25 of SARA proteins. 

In accordance with a further embodiment, the invention provides 
substantially purified SARA proteins, including the proteins of Table 2 (hSARAI), 
Table 4 (hSARA2), Table 6 (XSARA1) and Table 8 (XSARA2). 

In accordance with one embodiment, a SARA protein has a FYVE domain, 
30 a Smad binding domain (SBD) and an amino acid sequence having at least 50% 
overall identity with the amino acid sequence of hSARA! (Sequence ID NO:2). 
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In accordance with a preferred embodiment, a SARA protein has a FYVE 
domain having at least 65% identity of amino acid sequence with the FYVE 
domain of hSARAI and a C-terminal sequence of 550 consecutive amino acids 
which have at least 50% identity with the C-terminal 550 amino acid residues of 
5 hSARAI. 

In accordance with a more preferred embodiment, a SARA protein has a 
FYVE domain having at least 65% identity of amino acid sequence with the 
FYVE domain of hSARAI and wherein the portion of the SBD corresponding to 
amino acid residues 721 to 740 of hSARAI has at least 80% identity with that 
1 0 portion of hSARAI . 

The invention further provides a method for producing antibodies which 
selectively bind to a SARA protein comprising the steps of 

administering an immunogenically effective amount of a SARA 
immunogen to an animal; 
1 5 allowing the animal to produce antibodies to the immunogen; and 

obtaining the antibodies from the animal or from a cell culture derived 
therefrom. 

The invention further provides substantially pure antibodies which bind 
selectively to an antigenic determinant of a SARA protein. The antibodies of the 
20 invention include polyclonal antibodies, monoclonal antibodies and single chain 
antibodies. 

The invention includes analogues of the disclosed protein sequences, 
having conservative amino acid substitutions therein. The invention also 
includes fragments of the disclosed protein sequences, such as peptides of at 
25 least 6, preferably 10, more preferably 20 consecutive amino acids of the 
disclosed protein sequences. 

The invention further provides polypeptides comprising at least one 
functional domain or at least an antigenic determinant of a SARA protein. 
In accordance with a further embodiment, the invention provides 
30 peptides which comprise SARA protein Smad binding domains and 
polynucleotides which encode such peptides. 
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In accordance with a further embodiment, the invention provides a Smad 
binding domain peptide selected from the group consisting of 

(a) SASSQSPNPNNPAEYCSTIPPLQQAQASGALSSPPPTVMVPVGV 

LKHPGAEVAQPREQRRVWFADGILPNGEVADAAKLTMNGTSS; and 
5 (b) amino acids 589 to 672 of the XSARA1 sequence of Table 9. 

The invention includes fragments and variants of these Smad binding 
domain peptides which retain the ability to bind a Smad protein. 

In accordance with a further embodiment, the invention provides 
peptides which comprise SARA protein FYVE domains and polynucleotides 
10 which encode such peptides. 

In accordance with a further embodiment, the invention provides a FYVE 
domain peptide selected from the group consisting of 

(a) amino acids 587 to 655 of the hSARAI sequence of Table 9; 

(b) amino acids 510 to 578 of the XSARA1 sequence of Table 9; and 
1 5 (c) the consensus amino acid sequence of Table 1 0. 

The invention includes fragments and variants of these FYVE domain 
peptides which retain the function of the parent peptide. 

In accordance with a further embodiment, the invention provides 
peptides which comprise SARA protein TGFp receptor interacting domains and 
20 polynucleotides which encode such peptides. 

In accordance with a further embodiment, the invention provides a TGFp 
receptor interacting domain peptide comprising amino acids 751 to 1 323 of the 
hSARAI sequence of Table 9. 

The invention includes fragments and variants of these TGFp receptor 
25 binding domain peptides which retain the binding ability of the parent peptide. 

The invention further provides methods for modulating signaling by 
members of the TGFp superfamily which signal through pathways which involve 
a SARA protein. 

Modulation of signaling by a TGFP superfamily member through such a 
30 pathway may be effected, for example, by increasing or reducing the binding of 
the SARA protein involved in the pathway with its binding partner. 
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In accordance with a further embodiment, TGFp superfamily signaling, 
including TGFp signaling, by a pathway involving a SARA protein described 
herein may be modulated by modulating the binding of the SARA protein to a 
Smad binding partner, by modulating the binding of its FYVE domain to its 
5 binding partner or by modulating the binding of the SARA protein to a TGFp 
superfamily receptor, such as the TGFp receptor. 

For example, the binding of a SARA protein to a Smad binding partner 
may be inhibited by a deletion mutant of the protein lacking either the SBD 
domain or the FYVE domain or by the SARA protein Smad binding domain 

1 0 peptides or FYVE domain peptides described herein, and effective fragments or 
variants thereof. The binding of a SARA protein to a TGFp superfamily receptor 
may be inhibited by a deletion mutant of the protein lacking a C terminal portion 
or by the SARA protein TGFp receptor binding domain peptides described 
herein, and effective fragments and variants thereof. 

15 In accordance with a further embodiment, TGFp superfamily signaling, 

including TGFp signaling, by a pathway involving a SARA protein may be 
modulated by modulating the binding of the SARA protein FYVE domain to 
phosphatidyl inositol-3-phosphate, by increasing or decreasing the availability of 
phosphatidyl inositol-3-phosphate or by administration of agonists or antagonists 

20 of phosphatidyl inositol-3-phosphate kinase. 

The invention also provides a method of modulating a TGFp superfamily 
signaling pathway involving phosphatidyl inositol-3-phosphate, including a 
TGFp signaling pathway, by increasing or decreasing the availability of SARA 
protein or by modulating the function of SARA protein. 

25 The invention further provides methods for preventing or treating diseases 

characterised by an abnormality in a TGFp superfamily member signaling 
pathway which involves a SARA protein, by modulating signaling in the 
pathway, as described above. 



30 associated with scarring, with arthritis and with fibrosis in numerous diseases, 
including fibrosis of the liver and kidney. TGFp signaling is also involved in 



TGFp signaling is important in wound healing, and excessive signaling is 
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modulating inflammatory and immune responses and can contribute to tumour 
progression. 

The invention thus provides methods for modulating TGFp-dependent 
cell proliferation or fibrogenesis. 



in protecting tissues and restoring or regenerating tissues after tissue damage, for 
example in bone, kidney, liver and neuronal tissue (see, for example, (Reddy, 
A.H. (1998), Nature Biotechnology, v. 16, pp. 247-252). 

The invention further provides methods for modulating BMP-dependent 
10 phenotypic marker expression by modulating the interactions of SARA proteins 
involved in these BMP signaling pathways. 

In accordance with a further embodiment, modified versions of a SARA 
protein may be provided as dominant-negatives that block TGFp superfamily 
signaling. These modified versions of SARA could, for example, lack the Smad 
1 5 binding domain and thereby prevent recruitment of Smad or could lack the FYVE 
domain and thereby inhibit signaling by interfering with translocation. 

These modified versions of SARA may be provided by gene therapy, for 
example using transducing viral vectors. Expression may be driven by inclusion 
in the vector of a promoter specific for a selected target cell type. Many 
20 examples of such specific promoters are known to those skilled in the art. 

In a further embodiment, a normal version of a SARA protein such as 
hSARAI could be provided by gene therapy to restore function in a disease 
wherein SARA is mutated or non-functional. 



25 composition comprising a purified SARA protein as active ingredient. 

In accordance with a further embodiment, the invention provides non- 
human transgenic animals and methods for the production of non-human 
transgenic animals which afford models for further study of the SARA system and 
tools for screening of candidate compounds as therapeutics. For example, knock 

30 out animals, such as mice, may be produced with deletion of a SARA gene. 



5 



The BMP signaling pathways are important in tissue morphogenesis and 



In a further embodiment, the invention provides a pharmaceutical 
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These animals may be examined for phenotypic changes and used to screen 
candidate compounds for effectiveness to reverse these changes. 

In a further example, transgenic animals may be produced expressing a 
dominant negative mutant of a SARA protein, as described above, either 
generally or in specific targeted tissues. 

The invention provides many targets for the development of small 
molecule drugs, including peptides and peptidomimetic drugs, to interfere with 
the interaction of the various binding partners described herein and thereby 
modulate signaling by members of the TGFp superfamily, including TGFp and 
BMPs. 

The invention further provides methods for screening candidate 
compounds to identify those able to modulate signaling by a member of the 
TGFp superfamily through a pathway involving a SARA protein. 

For example, the invention provides screening methods for compounds 
able to bind to a SARA protein which are therefore candidates for modifying the 
activity of the SARA protein. Various suitable screening methods are known to 
those in the art, including immobilization of a SARA protein on a substrate and 
exposure of the bound SARA protein to candidate compounds, followed by 
elution of compounds which have bound to the SARA protein. The methods 
used to characterise the binding interactions of the SARA proteins disclosed 
herein, as fully described in the examples herein, may also be used to screen for 
compounds which are agonists or antagonists of the binding of a SARA protein. 

This invention also provides methods of screening for compounds which 
modulate TGFp superfamily signaling by detecting an alteration in the 
phosphorylation state of a SARA protein. 

In accordance with a further embodiment, the invention provides a 
method for reducing or preventing TGFp, activin or BMP signaling by inhibiting 
the activity of SARA. SARA activity may be inhibited by use of an antisense 
sequence to the SARA gene or by mutation of the SARA gene. 
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Summary of the Drawings 

Certain embodiments of the invention are described, reference being 
made to the accompanying drawings, wherein: 

Figure 1 (top panel) shows interaction of full length hSARAl with 
5 bacterial ly expressed Smads. Full length SARA protein was produced in an in 
vitro transcription/translation system in the presence of [ 35 S]methionine and was 
incubated with glutathione-sepharose beads coated with bacterial ly-expressed 
GST fusion proteins of the indicated Smads or Smad2 subdomains. Bound 
material was resolved by SDS-PAGE and visualized by autoradiography. 

1 0 Migration of full length hSARAl , and a translation product that initiates from an 
internal methionine located upstream of the Smad binding domain (asterisk) are 
indicated. The presence of approximately equivalent amounts of GST fusion 
proteins was confirmed by SDS-PAGE and coomassie staining of a protein 
aliquot (bottom panel). 

1 5 Figure 2 shows interaction of hSARA with Smads in mammalian cells. 

COS cells were transfected with Flag-tagged hSARAl (Flag-SARA) either alone or 
together with the indicated Myc-tagged Smad constructs. For Smad6, an 
alternative version lacking the MH1 domain was used (Topper et al., 1997). Cell 
lysates were subjected to an anti-Flag immunoprecipitation and coprecipitating 

20 Smads detected by immunoblotting with anti-Myc antibodies. The migration of 
anti-Flag heavy and light chains (IgG) are marked. To confirm efficient 
expression of hSARAl and the Smads, aliquots of total cell lysates were 
immunoblotted with the anti-Flag and anti-Myc antibodies (bottom panel). The 
migrations of hSARAl and the Smads are indicated. 

25 Figures 3-6 show immunoblots of lysates from COS cells transiently 

transfected with various combinations of Flag or Myc-tagged hSARAl , wild type 
(WT) or mutant (2SA) Myc or Flag-tagged Smad2, Smad4/HA and wild type (WT) 
or constitutively active (A) TpRI/HA, cell lysates being subjected to 
immunoprecipitation with anti-Flag or anti-Myc antibodies, as indicated. 

30 Confirmation of protein expression was performed by immunoblotting total cell 
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lysates prepared in parallel for the indicated tagged protein (totals, bottom 
panels). 

Figure 3: Transfected cells were metabolically labelled with [ 32 P]PO< and 
cell lysates subjected to immunoprecipitation with anti-Flag antibodies for 
visualization of hSARAI phosphorylation (top panel) or with anti-Myc antibodies 
for Smad2 phosphorylation (middle panel). Immunoprecipitates were resolved 
by SDS-PAGE and visualized by autoradiography. The migrations of hSARAI 
and Smad2 are indicated. 

Figure 4: Lysates from transiently transfected COS cells were subjected to 
immunoprecipitation with anti-Flag antibodies and Smad2 bound to hSARAI 
was analyzed by immunoblotting with anti-Myc antibodies (IP: a-flag; blot: a- 
Myc). 

Figure 5: Lysates from transiently transfected COS cells were subjected to 
immunoprecipitation with anti-Flag antibodies and Smad2 bound to hSARAI 
was analyzed by immunoblotting with anti-Myc antibodies (IP: a-flag, blot: a- 
Myc). Partial dissociation of hSARA1/Smad2 complexes induced by TGFp 
signaling was enhanced by expression of Smad4. 

Figure 6: Cell lysates from transiently transfected COS cells were 
subjected to immunoprecipitation with anti-Flag antibodies directed towards 
Smad2. Immunoprecipitates were then immunoblotted using anti-Myc or anti- 
HA antibodies which recognize hSARAI or Smad4, respectively. 
Coprecipitating SARA (a-myc blot) and Smad4 (a-HA blot) are indicated. 

Figure 7, panels A to E, shows photomicrographs of MvlLu cells 
transiently transfected with various combinations of Flag-Smad2, Myc-hSARAI, 
and constitutively active TfiRI (T0RI*) as indicated (Tx). hSARA was visualized 
with the polyclonal Myc A14 antibody and Texas-Red conjugated goat-anti- 
rabbit IgC (red) and Smad2 was detected with an anti-Flag M2 monoclonal 
antibody followed by FITC-conjugated goat anti-mouse IgG (green). The 
subcellular localization of the expressed proteins was visualized by 
immunofluorescence and confocal microscopy. 
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Panels A, B, C, Mvl Lu cells singly transfected with hSARAI (A) or Smacl2 
(B) are shown. Cotransfection of Smad2 with the constitutively active TpRI 
(TPRI*) results in its accumulation in the nucleus (C). 

Panel D, Mv1 Lu cells were transfected with hSARAI and Smad2 and the 
5 localization of hSARAI (red, left photo) and Smad2 (green, centre photo) is 

shown. Colocalization of SARA and Smad2 is shown (right photo) and appears 
as yellow. 

Panel E, Mvl Lu cells were transfected with hSARAI, Smad2 and activated 
TpRI (TpRI*) and the localization of hSARA (red, left photo) and Smad2 (green, 
10 centre photo) is shown. Colocalization of SARA and Smad2 is indicated (right 
photo). Note the shift to an orangy-red colour in the punctate spots and an 
intensification of Smad2 nuclear staining, indicative of dissociation of Srnad2 
from SARA and nuclear translocation. 



1 5 rabbit, polyclonal anti-SARA antibody (left photo, green), goat, polyclonal anti- 
Smad 2/3 antibody (centre photo, red) and with both antibodies (right photo, 
yellow), showing co-localization of hSARAI and Smad2. 

Figure 8A shows photomicrographs of MvlLu cells transfected with either 
hSARAI alone (panel i), TpRII alone (panel ii) or hSARAI and TpRII together 

20 (panel iii), then treated with TGFp and the localization of hSARAI (red) and 

TpRII (green) determined by immunofluorescence and confocal microscopy. In 
cells coexpressing hSARAI and TpRII, superimposing the staining revealed 
colocalization of the proteins as indicated by yellow staining in panel iii. 



25 various combinations of Flag-hSARAI , Myc-Smad2, wild type (WT) TpRII and 
either wild type or kinase-deficient (KR) versions of TpRI. Cells were affinity- 
labelled with [ 125 l]TGFp and lysates immunoprecipitated with anti-Flag 
antibodies. Coprecipitating receptor complexes were visualized by SDS-PAGE 
and autoradiography. Equivalent receptor expression was confirmed by 

30 visualizing aliquots of total cell lysates (bottom panel). 



Figure 7, panel F, shows photomicrographs of Mv1 Lu cells stained with 



Figure 8B shows affinity labelling of COS cells transiently transfected with 



WO 00/05360 




PCT/CA99/00656 



14 

Figure 9A shows COS cells transiently transfected with wild type TpRII 
and kinase-deficient TpRI and various combinations of wild type Flag-hSARAl 
(WT), a mutant version lacking the Smad2 binding domain (ASBD) and Myc- 
Smad2. The amount of receptor bound to SARA was determined by anti-Flag 
5 immunoprecipitation followed by gamma counting. Data is plotted as the 
average of three experiments ±S.D. Protein expression was analyzed by 
immunoblotting aliquots of total cell lysates and the results from a representative 
experiment are shown (bottom panel). 

Figure 9B shows COS cells transiently transfected with wild type TpRII 

1 0 and kinase-deficient TpRI and Flag-tagged wild type (WT) or mutant versions of 
hSARAl with (black bars) or without (open bars) Myc-Smad2. The amount of 
receptor bound to hSARAl was determined by anti-Flag immunoprecipitation 
followed by gamma counting. Protein expression was analyzed by 
immunoblotting aliquots of total cell lysates (bottom panel). 

1 5 Figure 1 0 is a schematic representation of mutant versions of SARA. The 

FYVE domain (shaded bar) and the Smad binding domain, SBD (striped bar), are 
indicated. COS cells transiently transfected with Flag-hSARAl and Myc-Smad2 
were immunoprecipitated with anti-Flag antibodies followed by immunoblotting 
with anti-Myc antibodies. The presence (+) or absence (-) of a hSARA1/Smad2 

20 interaction is indicated (Smad2 interaction). Mutants used for the subsequent 
localization study are marked on the left (i-vi). 

Figure 1 1 A shows an immunoblot of lysates from COS cells expressing 
Flag-tagged Smad2 or Smad3 incubated with GST alone or with GST-hSARAI 
(665-750), which corresponds to the SBD; bound proteins were immunoblotted 

25 using anti-Flag antibodies. The presence of Smad2 and Smad3 bound to GST- 
hSARAI (665-750) is indicated. 

Figure 1 1 B shows an immunoblot of lysates, from COS cells expressing 
Flag-tagged Smad2 together with wild type (WT) or activated (A) type I receptor, 
incubated with GST-hSARAI (665-750) (GST-SBD) and immunoblotted with anti- 

30 Flag antibodies. The expression levels of Smad2, each receptor and GST- 
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hSARAI (665-750) were determined by immunoblotting aliquots of total cell 
I y sates. 

Figure 12 shows the subcellular localization of hSARAI mutants. Mv1 Lu 
cells were transiently transfected with wild type (panel i) or mutant versions of 
5 Flag-hSARA1 (panels ii-viii, as marked on the left in Figure 10). Proteins were 
visualized by immunofluorescence and confocal microscopy using a monoclonal 
anti-Flag M2 monoclonal antibody followed by FITC-conjugated goat anti-mouse 
IgC. 

Figure 13 shows photomicrographs of Mvl Lu cells transiently transfected 
1 0 with mutant versions of Myc-hSARAI and Flag-Smad2 (panel A) or with wild 
type Myc-hSARAI, HA-Smad2 and mutant versions of hSARAI (panel B). 
Protein subcellular localization was visualized by immunofluorescence and 
confocal microscopy. hSARAI was visualized with the polyclonal Myc A14 
antibody and FITC-conjugated goat anti-rabbit IgG (green), while Smad2 was 
1 5 detected with monoclonal antibodies followed by Texas Red-conjugated goat 
anti-mouse IgG (red). In B, overlaying the images reveals mislocalization of 
Smad2 as green speckles of SARA over red, diffuse Smad2 staining (panels ii and 
iii) and colocalization of hSARAI and Smad2 appears as yellow spots (panels i 
and iv). 

20 Figure 14 shows luciferase activity of MvlLu cells transfected with 3TP- 

lux alone or together with the indicated amounts of wild type (WT) or mutant 
(A1-664 or A1-704) versions of hSARAI and incubated in the presence (black 
bars) or absence (open bars) of TGFp. Luciferase activity was normalized to p- 
galactosidase activity and is plotted as the mean ±S.D. of triplicates from a 

25 representative experiment. 

Figure 15 shows luciferase activity of HepG2 cells transfected with ARE- 
Lux alone (v), or ARE-Lux and FAST2 alone or together with the indicated 
amounts of wild type (WT) or mutant versions of hSARAI . Transfected cells 
were incubated in the presence (black bars) or absence (open bars) of TGFp and 

30 luciferase activity was normalized to p-galactosidase activity and is plotted as the 
mean ±S.D. of triplicates from a representative experiment. 
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Figure 16 shows a Northern blot of expression of hSARAI (upper panel) 
and Smad2 (lower panel) in the indicated tissues. 

Figure 17 shows an immunoblot of a HepG2 lysate immunoprecipitated 
(IP) with preimmune serum (PI), anti-hSARA1 polyclonal antibody (SARA) with 
5 and without pretreatment with TGF(3 (- and + ), or N 1 9 anti-Smad2/3 antibody 
(S2), followed by immunoblotting with an anti-Smad2 antibody. The migration 
position of Smad2 is indicated (Smad2). 

Figure 18 shows a diagram of a model of the interaction of a SARA 
protein with a receptor regulated Smad, as exemplified by the interaction of 
10 hSARAI. 

Detailed Description of the Invention 

This invention provides a family of proteins that play key roles in TGF-p, 
activin and bone morphogenetic protein (BMP) signal transduction pathways. In 

1 5 particular, the proteins of this family interact with specific Smad proteins to 

modulate signal transduction. These proteins are therefore designated as "Smad 
Anchor for Receptor Activation" or "SARA" proteins. SARA proteins are 
characterised by three distinct domains (1) a double zinc finger or FYVE domain 
responsible for the subcellular localization of the SARA protein or SARA-Smad 

20 complex, possibly through its association with Ptdlns(3)P, (2) a Smad binding 
domain ("SBD") which mediates the interaction or binding of one or more 
species of Smad protein with the particular member of the SARA family and (3) a 
carboxy terminal domain which mediates interaction of SARA with members of 
the TGFfJ superfamily of receptors. 

25 FYVE domains have been identified in a number of unrelated signaling 

molecules that include FGD1, a putative guanine exchange factor for Rho/Rac 
that is mutated in faciogential dysplasia, the HGF receptor substrate Hrs-1 and its 
homolog Hrs-2, EEA1, a protein involved in formation of the early endosome 
and the yeast proteins FAB1, VPS27 and VAC1 (reviewed in Wiedemann and 

30 Cockcroft, 1 998). Recently, analysis of a number of FYVE domains from yeast 
.and mammals has revealed that this motif binds phosphatidyl inositol-3- 
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phosphate (Ptdlns(3)P) with high specificity and thus represents a novel signaling 
module that can mediate protein interaction with membranes (Burd and Emr, 
1998; Gaullier et al., 1998; Patki et al., 1998; Simonsen et al., 1998; 
Wiedemann and Cockcroft, 1998). Comparison of the FYVE domains from the 
5 vertebrate proteins with that from SARA revealed extensive conservation of 
residues throughout the domain (Table 10). Thus, SARA contains a FYVE 
domain that may function to bind Ptdlns(3)P, which has been implicated in 
intracellular vesicle transport. 

For example, deletion of the FYVE domain in hSARAI causes 
10 mislocalization of Smad2 or Smad3, interferes with TGFp receptor interaction 
and inhibits TGFp-dependent transcriptional responses. 

Thus, the SARA proteins of the invention define a component of TGFp 
superfamily signaling that fulfills an essential role in anchoring receptor 
regulated Smads to specific subcellular domains for activation by a TGFp 
1 5 superfamily receptor. 

Cloned DNA coding sequences and corresponding amino acid sequences 
for representative human and Xenopus SARA protein family members are shown 
in the Tables, as follows: 

Tables 1 and 2 - human SARA1 (hSARAI) cDNA (Sequence ID NO:1) 
20 and amino acid sequence (Sequence ID NO:2) respectively; 

Tables 3 and 4 - human SARA2 (hSARA2) cDNA (Sequence ID NO:3) 
and amino acid sequence (Sequence ID NO:4) respectively; 

Tables 5 and 6 - Xenopus SARA1 (XSARA1) cDNA (Sequence ID NO:5) 
and amino acid sequence (Sequence ID NO:6) respectively; and 
25 Tables 7 and 8 - Xenopus SARA2 (XSARA2) cDNA (Sequence ID NO:7) 

and amino acid sequence (Sequence ID NO:8) respectively. 

Table 9 shows a comparison of the amino acid sequences of XSARA1 and 
hSARAI . Identical residues (dark grey) and conservative changes (light grey), 
the FYVE domain (solid underline) and the Smad binding domain (dashed 
30 underline) are indicated. The sequences in XSARA1 used to design degenerate 
PCR primers for identifying hSARAI are shown (arrows). The amino-terminal 
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end of the partial Xenopus cDNA obtained in the expression screen is marked 
(asterisk). 

The human SARA of Tables 1 and 2, identified as described in Example 2, 
regulates the subcellular localization of Smad2 and Smad3 and recruits these 
Smads into distinct subcellular domains. This SARA also interacts with TGFp 
receptors and TGFp signaling induces dissociation of Smad2 or Smad3 from the 
SARA protein with concomitant formation of Smad2/Smad4 complexes and 
nuclear translocation. 

Table 10 shows alignment of the amino acid sequences of the FYVE 
domains from hSARAI, XSARA1, KIAA0305, FGD1, Hrs-1, Hrs-2 and EEA1. 
Identical residues (dark grey) and conservative changes (light grey) are marked. 
A consensus sequence (bottom) was derived from positions in which at least 6 
out of 7 residues were conserved or when proteins contained one of only two 
alternate residues. 

The regulation of the subcellular localization of components of signaling 
pathways can be key determinants in the effective initiation and maintenance of 
signaling cascades. Targeting of signal transduction proteins to specific 
subcellular regions is highly regulated, often through specific interactions with 
scaffolding or anchoring proteins (Faux and Scott, 1996; Pawson and Scott, 
1997). Scaffolding proteins have been defined as proteins that bind to multiple 
kinases to coordinate the assembly of a cascade, while anchoring proteins are 
tethered to specific subcellular regions in the cell and can act to bring together 
components of a pathway. Regulating location of signaling components can 
thus coordinate the activity of a signaling network, maintain signaling specificity 
or facilitate activation of a pathway by localizing kinases together with their 
downstream substrates. 

As described herein, a recombinantly produced human SARA protein 
bound directly and specifically to unphosphorylated Smad2 and Smad3. In 
addition, receptor-dependent phosphorylation induced Smad2 to dissociate from 
SARA, bind to Smad4 and translocate to the nucleus. Thus, the hSARAI protein 
. functions in TGFp signaling upstream of Smad activation to recruit Smad2 to the 
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TGFp receptor by mediating the specific subcellular localization of Smad and by 
associating with the TGFp receptor complex. Furthermore, inducing 
mislocalization of Smad2 by expressing a mutant of the hSARAI protein blocks 
TGFp-dependent transcriptional responses, indicating an essential role for SARA- 
5 mediated localization of Smads in signaling. Together, these results identify the 
cloned hSARAI protein as a novel component of the TGFp pathway that 
functions to anchor Smad2 to specific subcellular sites for activation by the TGFp 
receptor kinase. 



10 and are phosphorylated on the C-terminal SSXS motif (Abdollah et aL, 1997; 

Kretzschmar et al., 1997; Macias-Silva et al., 1996; Souchelnytskyi et al., 1997). 
This phosphorylation is similar to receptor-dependent phosphorylation in 
mammalian cells, suggesting that SARA is not absolutely required for recognition 
of Smads by the receptor complex. In intact cells, however, receptor-regulated 

1 5 Smads are cytosolic proteins that require activation by transmembrane 
serine/threonine kinase receptors. Consequently, Smads may require 
recruitment by SARA to interact with TGFp superfamily receptors. Domains in 
which SARA is found correspond to regions where TGFp receptors are also 
localised. TGFp receptors display regionalized localization and hSARAI recruits 

20 Smad2 to these domains. The identity of these intracellular domains is unclear. 
However, they contain receptors and recent evidence has shown that FYVE 
finger domains interact with membranes, so it is reasonable to suggest that these 
domains represent membrane vesicles. Thus, clustering of the TGFp receptor, as 
previously described by Henis et al. (1994), may function to direct the receptor 

25 to hSARAI and the Smad2 substrate. This activity may be most critical in vivo, 
where ser/thr kinase receptors are often found in low numbers and only a small 
proportion need to be activated for biological responses (Dyson and Gurdoh, 
1998). This activity is likely to be most critical in v/vo, where ser/thr kinase 
receptors are often found in low numbers and only a small proportion need to be 

30 activated for biological responses (Dyson and Gurdon, 1998). This may impose 



In vitro, receptor-regulated Smads are recognized by the receptor kinases 
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on the pathway a stringent requirement for SARA to anchor Smads in these sites 
for receptor interaction. 

The colocalization and association of hSARAI with the TGF0 receptor 
defines a role for hSARAI in recruiting Smad2 to the receptor kinase. 
Furthermore, deletion of the FYVE domain interferes with receptor binding, 
prevents the correct localization of hSARA1/Smad2 and blocks TGFp signaling 
(see Example 8 below), suggesting that this is an important function in the 
pathway. Interestingly, the binding of the hSARAI protein identified in Example 
2 to the receptor was enhanced upon Smad2 expression and, on its own, SARA 
may interact inefficiently with the receptor. However, within the hSARA1/Smad 
complex, Smad2 might help drive association with the receptor through its 
recognition of the catalytic region of the kinase domain. Consistent with this, 
cooperation requires a kinase deficient type I receptor which also traps the 
Smad2 substrate (Macfas-Silva et al., 1996). Thus, Smad2 may bind to the 
catalytic pocket of the type I kinase domain while hSARAI, which is not a 
substrate of the kinase, may interact with regions outside of the domain. 

The human SARA protein identified in Example 2 did not interact with 
any of the other Smads tested, indicating that it functions specifically in Smad2 
and Smad3 pathways (see Example 3). However, Smad5 localization in 293 
cells displayed a remarkably similar pattern to that of this SARA protein 
(Nishimura et al., 1998) and similar patterns were observed for endogenous 
Smadl or 5 in the kidney epithelial cell line, IMCD-3. Thus, localization of 
BMP-regulated Smads (for example, Smadl, Smad5 and Smad8) may also be 
regulated by a specific SARA family member. 

The genes for two other SARA family member proteins were also 
identified and cloned. One of these, identified in Xenopus and designated 
XSARA2 (Tables 7 and 8), is related to XSARA 7, while the other one, hSARA2 
(Tables 3 and 4), is a human clone, related to the hSARA 1 of Tables 1 and 2. 
This second human clone has been identified in EST clone KIAA0305. A 
comparison of the SBD from hSARAI with a similar region from the KIAA0305 
sequence indicated that the amino terminal half of the region of the SBD was 
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highly divergent from the amino acid sequence encoded by KIAA0305. This 
suggests that the protein encoded by KIAA0305 may mediate binding with other 
as yet unidentified proteins, eg. other Smads. In contrast to the SBD, the FYVE 
domain of the KIAA0305 protein is more closely related to the hSARAI FYVE 
5 domain (70% identity), suggesting that this protein may be an anchor for other 
Smad proteins that function either in the TGFp pathway or in other signaling 
cascades, such as the BMP signal transduction pathway. 

SARA is not limiting in Smad activation and TGFp superfamily signaling 

10 It was observed that elevating Smad2 levels can saturate hSARAI and 

yield a diffuse distribution for Smad2. Thus, the level of the hSARAI protein is a 
key determinant in controlling Smad2 localization. As a consequence, 
endogenous Smad2 may or may not display a hSARAI -like distribution, 
depending on the relative expression of the two proteins. Indeed, in Mvl Lu 
1 5 cells, endogenous Smad2 displays a punctate pattern with some diffuse staining 
in the cytosol. While not meaning to limit the invention to a particular 
mechanism, the data are consistent with the view that once signaling has 
commenced, Smad2 dissociates from hSARAI, binds to Smad4 and translocates 
to the nucleus, freeing hSARAI to recruit additional Smad2 from the cytosolic 
20 reservoir. This would provide a mechanism to allow quantitative activation of 
Smads in the presence of high levels of TGFp signaling. 

By functioning to recruit Smad2 to the TGFp receptor, hSARAI is located 
in an important regulatory position in the pathway. Thus, control of hSARAI 
localization or protein levels, or its interaction with Smad2, could modulate 
25 TGFp signaling. Further, disruption of normal hSARAI function could 

potentially be involved in loss of TGFp responsiveness that is a common feature 
during tumour progression. 

Modular Domains in SARA 

30 The function of hSARA! in TGFp signaling is mediated by three 

independent domains, the Smad binding domain (SBD) that mediates specific 
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interaction with Smad2 and Smad3, the FYVE domain that targets 
hSARA1/Smad2 to specific subcellular sites and the carboxy terminus which 
mediates association with the TGFB receptor. The Xenopus and mouse 
forkhead-containing DNA binding proteins, FAST1 and FAST2, bind specifically 
5 to Smad2 and Smad3 and like hSARAI, interact with the MH2 domains (Chen et 
al., 1996; Chen etal., 1997a; Labbe et al., 1998; Liu et al., 1997a). 
Comparison of the SBD from this SARA with the Smad Interaction Domain (SID) 
from these FAST proteins revealed no regions of obvious similarity. However, 
since hSARAI acts upstream and FAST downstream of Smad activation, these 

1 0 proteins may employ structurally unrelated domains to distinguish unactivated 
versus activated forms of Smad2. Thus, the SBD of this SARA protein 
preferentially binds unphosphorylated monomeric Smad2 while the SID from 
FAST must bind phosphorylated Smad2 in heteromeric complexes with Smad4. 
By analogy, the SBD of other SARA family members may bind the 

1 5 unphosphorylated monomeric species of other Smads that mediate signal 
transduction in other pathways (eg. Smads 1, 5 or 8 in the BMP signal 
transduction pathway). 

In hSARAI, the FYVE domain functions independently of the SBD, to 
mediate the subcellular targetting of the protein. The FYVE-finger motif has now 

20 been identified in at least 30 proteins from diverse species, such as FGD1, Hrs-1 
and 2, and EEA1 (Gaullier et al., 1998; Wiedemann and Cockcroft, 1998). 
Recent advances have demonstrated that FYVE finger motifs from a variety of 
divergent proteins have a conserved function and bind phosphatidyl inositol-3- 
phosphate (Ptdlns(3)P) with high specificity (Burd et al., (1998); Patki (1998); 

25 Gaullier (1 998)). Through this interaction, the FYVE finger can mediate protein 
interactions with phospholipid bilayers. However, Ptdlns(3)P is present 
ubiquitously on cell membranes and in the case of EEA1, further protein-protein 
interactions with Rab5-GTP are required in addition to the FYVE domain to 
target the protein to the correct membranes (Simonsen et al., 1998). Given that 

30 Ptdlns(3)P binding by FYVE fingers is conserved in yeast and mammals, it is 
likely that the FYVE finger of hSARAI similarly mediates interaction with the 
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membrane. Furthermore, it is possible that additional protein-protein 
interactions may be required to direct hSARAI to regions that contain the TGFp 
receptors. The carboxy terminus of hSARAI, which is required for efficient 
interaction with the TGFp receptor, may function in this capacity. 
5 Together, these data define discrete domains in SARA that fulfill specific 

aspects of SARA function in TGFp superfamily signaling. Without being limited 
to any particular mechanism, a possible model of the interaction of SARA with a 
receptor regulated Smad in TGFp superfamily signaling, as exemplified by 
hSARAI and its interactions with Smad2 in TGFp signaling, is shown 

10 diagrammatically in Figure 18. The FYVE domain likely functions to direct SARA 
to the membrane, perhaps through interactions with Ptlns(3)P. It thus fulfills an 
important role in recruiting hSARAI to specific subcellular domains that have 
been shown also to contain the TGFp receptor. The SBD in turn functions to 
bind unactivated Smad2, thus recruiting the receptor substrate to this subcellular 

15 region. Once localized to this region, the C-terminal domain of hSARAI 

functions with Smad2 bound to the SBD to promote interaction with the receptor 
complex. These three domains thus function cooperatively to recruit Smad2 to 
the TGFP receptor. 



20 Additional Roles for SARA 

Controlling the localization of kinases and their substrates may allow not 
only for efficient recognition and phosphorylation but may also function to 
maintain specificity and suppress crosstalk between signaling pathways. Thus, 
by controlling Smad localization, a SARA family member protein could 

25 additionally function to maintain the highly specific regulation of Smad 

phosphorylation by ser/thr kinase receptors that is observed in vivo and could 
prevent promiscuous phosphorylation by other kinases in the cell. Furthermore, 
through its interactions with a particular receptor, a SARA protein might function 
to control the activity or turnover of the receptor complex. Alternatively, SARA 

30 may also fulfill scaffolding functions to coordinate the receptor-dependent 
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activation of Smads with other as yet unidentified components of a signalin 
pathway. 



Nucleic Acids 

5 In accordance with one series of embodiments, the present invention 

provides isolated nucleic acids corresponding to, or related to, the human and 
Xenopus SARA nucleic acid sequences disclosed herein. In addition to the 
SARA nucleotide sequences disclosed herein, one of ordinary skill in the art is 
now enabled to identify and isolate homologues of the SARA genes described 

1 0 herein. One of ordinary skill in the art may screen preparations of genomic or 
cDNA from other species using probes or PCR primers derived from nucleotide 
sequences disclosed herein. In accordance with a further embodiment, the 
invention provides isolated nucleic acids of at least 10 consecutive nucleotides, 
preferably 15 consecutive nucleotides, more preferably 20 consecutive 

1 5 nucleotides of Sequences ID NO:!, Sequence ID NO:3, Sequence ID NO:5 and 
Sequence ID NO:7, up to the complete sequences. Short stretches of nucleotide 
sequence are useful as probes or primers useful for identification or amplification 
of the nucleic acids of the invention or for encoding fragments, functional 
domains or antigenic determinants of SARA proteins. 
20 The invention also includes polynucleotides which are complementary to 

the disclosed sequences, polynucleotides which hybridise to these sequences at 
high stringency and degeneracy equivalents of these sequences. 

Proteins 

25 SARA proteins may be produced by culturing a host cell transformed with 

a DNA sequence encoding a selected SARA protein. The DNA sequence is 
operatively linked to an expression control sequence in a recombinant vector so 
that the protein may be expressed. 

Host cells which may be transfected with the vectors of the invention 

30 may be selected from the group consisting of E. coli, Pseudomonas, Bacillus 
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subtillus, or other bacilli, yeasts, fungi, insect cells or mammalian cells including 
human cells. 

For transformation of a mammalian cell for expression of a SARA protein, 
the vector may be delivered to the cells by a suitable vehicle. Such vehicles 
5 including vaccinia virus, adenovirus, retrovirus, Herpes simplex virus and other 
vector systems known to those of skill in the art. 

A SARA protein may also be recombinantly expressed as a fusion protein. 
For example, the SARA cDNA sequence is inserted into a vector which contains 
a nucleotide sequence encoding another peptide (e.g. GST-glutathione succinyl 
10 transferase). The fusion protein is expressed and recovered from prokaryotic 

(e.g. bacterial or baculovirus) or eukaryotic cells. The fusion protein can then be 
purified by affinity chromatography based upon the fusion vector sequence and 
the SARA protein obtained by enzymatic cleavage of the fusion protein. 

The protein may also be produced by conventional chemical synthetic 
1 5 methods, as understood by those skilled in the art. 

SARA proteins may also be isolated from cells or tissues, including 
mammalian cells or tissues, in which the protein is normally expressed. 

The protein may be purified by conventional purification methods known 
to those in the art, such as chromatography methods, high performance liquid 
20 chromatography methods or precipitation. 

For example, anti-SARA antibodies may be used to isolate SARA protein 
which is then purified by standard methods. 



Antibodies 

25 The provision of the polynucleotide and amino acid sequences of SARA 

proteins provides for the production of antibodies which bind selectively to a 
SARA protein or to fragments thereof. The term "antibodies" includes 
polyclonal antibodies, monoclonal antibodies, single chain antibodies and 
fragments thereof such as Fab fragments. 

30 In order to prepare polyclonal antibodies, fusion proteins containing 

defined portions or all of a SARA protein can be synthesized in bacteria by 
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expression of the corresponding DNA sequences, as described above. Fusion 
proteins are commonly used as a source of antigen for producing antibodies. 
Alternatively, the protein may be isolated and purified from the recombinant 
expression culture and used as source of antigen. Either the entire protein or 
fragments thereof can be used as a source of antigen to produce antibodies. 

The purified protein is mixed with Freund's adjuvant and injected into 
rabbits or other appropriate laboratory animals. Following booster injections at 
weekly intervals, the animals are then bled and the serum isolated. The serum 
may be used directly or purified by various methods including affinity 
chromatography to give polyclonal antibodies. 

Alternatively, synthetic peptides can be made corresponding to antigenic 
portions of a SARA protein and these may be used to inoculate the animals. 

In a further embodiment, monoclonal anti-SARA antibodies may be 
produced by methods well known in the art. Briefly, the purified protein or 
fragment thereof is injected in Freund's adjuvant into mice over a suitable period 
of time, spleen cells are harvested and these are fused with a permanently 
growing myeloma partner and the resultant hybridomas are screened to identify 
cells producing the desired antibody. Suitable methods for antibody preparation 
may be found in standard texts such as Antibody Engineering, 2d. edition, 
Barreback, ED., Oxford University Press, (1995). 

Transgenic animals 

In accordance with a further embodiment, the invention provides for the 
production of transgenic non-human animals which afford models for further 
study of the SARA family of proteins and also provide tools for the screening of 
the candidate compounds as therapeutics. 

Animal species which are suitable for use include rats, mice, hamsters, 
guinea pigs, rabbits, dogs, cats, goats, sheep, pigs and non-human primates. 

In accordance with one embodiment, a transgenic animal may be 
prepared carrying a heterologous SARA gene by inserting the gene into a germ 
.line or stem cell using standard technique of oocyte microinjection, or 
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transfection or microinjection into embryonic stem cells. The techniques of 
generating transgenic animals are now well known and fully described in the 
literature. For example, a laboratory manual in the manipulation of the mouse 
embryo describes standard laboratory techniques for the production of 
5 transgenic mice (Hogan et al. (1986), Manipulating the Mouse Embryo, Cold 
Spring Harbour Laboratory Press, Cold Spring Harbour, New York). 

In accordance with a further embodiment, the invention enables the 
inactivation or replacement of an endogenous SARA gene in an animal by 
homologous recombination. Such techniques are also fully described in the 
10 literature. Such techniques produce "knock-out" animals, with an inactivated 
gene, or "knock-in" animals, with a replaced gene. 

EXAMPLES 

The examples are described for the purposes of illustration and are not 
1 5 intended to limit in any way the scope of the invention. 

Methods of molecular genetics, protein and peptide biochemistry and 
immunology referred to but not explicitly described in this disclosure and 
examples are reported in the scientific literature and are well known to those 
skilled in the art. 

20 

Example 1: Methods 

Isolation of Xenopus and human SARA 

To prepare a probe for library screening, the MH2 domain of Smad2 
(amino acids 241-467) was subcloned into a modified pGEX4T-1 vector 

25 containing the protein kinase A recognition site derived from pGEX2TK 

(Pharmacia). This bacterial fusion protein was purified, labelled with [ 32 P]yATP 
and used as probe to screen a XZAP II Xenopus dorsal lip library as described 
(Chen and Sudol, 1995). A screen of 1 x 10 6 plaques yielded four phage which 
represented repeated isolates of the same clone. This partial cDNA contained a 

30 2.1 kb open reading frame and 1 kb of 3' untranslated region (UTR). A full 

length clone was obtained by a combination of rescreening of the same dorsal 
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lip library using a 670 base pair EcoRI/Hpal fragment at the 5' end of this clone 
and by 5' RACE (Gibco/BRL) using stage 10 Xenopus RNA. 

To obtain a human homolog of Xenopus SARA, cDNA was synthesized 
from randomly primed total RNA isolated from HepG2 cells. This cDNA was 
5 subjected to polymerase chain reaction (PCR) using degenerate primers as 

described previously (Attisano et al., 1992). The 5' and 3' primers, designed to 
encode the zinc-finger motif, correspond to 

G C(A/C/G/T/)CC( A/C/G/T) AA(C/T)TG (C/TATG AA(A/C/G/T)TG (OT) a nd 
(A/G)CA(A/G)TA(C/T)TC(A/CyG/T)GC(A/C/G/T)GG(A/G)TT(A/G)TT, respectively. 

10 A 1 50 base pair PCR product was sequenced and then used as probe for 

screening a XZAP human fetal brain cDNA library (Stratagene). Eight positive 
plaques were obtained, two of which contained an overlap of approximately Ikb 
and covered the entire open reading frame. The sequence of the 5' UTR was 
confirmed by sequencing of an expressed sequence tag database clone (clone ID 

1 5 260739). 



Construction of Plasmids 

For mammalian expression constructs of SARA, the open reading frame of 
hSARA was amplified by PCR and was subcloned into pCMV5 in frame with an 

20 amino-terminal Flag or Myc tag (Hoodless et al., 1 996). The deletion mutants of 
pCMV5-Flag-hSaraA893-1323, A346-132, A893-1323, and A346-1323 were 
constructed by deletion of EcoRV-Hindlll, Xbal-Hindlll, Sall-EcoRV, and Sall-Xbal 
fragments, respectively. PCMV5-Flag-hSaraA1-594 and A1-686 were obtained 
by partial digestion with Asp718/Sall and for pCMV5-Flag-hSARA A665-1323 a 

25 Asp718/Hindlll partial digest was used. PCMV5-Flag-hSARAA596-704 was 
constructed by deleting Asp71 8 fragment. The other hSARA mutants were 
constructed by PCR using appropriate primers. PCMV5B-Myc-Smad3 and Myc- 
Smad6, pGEX4T-1-Smad2/MH1 (amino acids 1-181), pGEX4T-1-Smad2/linker 
(amino acids 186-273), pGEX4T-1-Smad2/MH2 (amino acids 241-467) and 

30 pGEX4T-1-h SARA (amino acids 665-750) were constructed by PCR. 
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In Vitro Protein Interactions 

In vitro transcription/translation reactions were performed using the TNT 
coupled reticulocyte lysate system (Promega) following the manufacturer's 
instructions using T3 RNA polymerase. Translation was carried out in the 
5 presence of [ 35 S]-methionine and labelled proteins were incubated with purified 
GST fusion proteins in TNTE buffer with 10% glycerol for 2 hours at 4°C and 
then washed five times with the same buffer. Bound protein was separated by 
SDS-PAGE and visualized by autoradiography. 



1 0 Immunoprecipitation and Immunoblotting 

COS-1 cells transfected with LipofectAMINE (GIBCO BRL) were lysed 
with lysis buffer (Wrana et aL, 1994) and subjected to immunoprecipitation with 
either anti-Flag M2 (IBI, Eastern Kodak) or anti-Myc (9E10) monoclonal antibody 
followed by adsorption to protein-G sepharose. Precipitates were separated by 
1 5 SDS-PAGE, transferred to nitrocellulose membranes and immunoblotted as 
described previously (Hoodless et aL, 1996). 



Af f i n i ty- Label ling 

LipofectAMINE transfected COS-1 cells were incubated with 200 pM 
20 [ 125 l]TGFp in media containing 0.2% bovine fetal serum at 37°C for 30 minutes 
and receptors were cross-linked to ligand with DSS as described previously 
(Macias-Silva et aL, 1996). Cell lysates were immunoprecipitated with anti-Flag 
antibody and receptors visualized by SDS-PAGE and autoradiography. In some 
cases, cross-linked [ 125 l]TGFp was determined by gamma counting. 

25 

Subcellular Localization by Immunofiuorescent Confocal Microscopy 

Mv1 Lu cells, plated on gelatin-coated Permanox chamber slides (Nunc), 
were transfected by the calcium phosphate-DNA precipitation method. Fixation, 
permeabilisation and reaction with the primary and secondary antibodies were 
30 described previously (Hoodless et aL, 1996). Monoclonal anti-Flag antibodies 
were visualized by FITC-conjugated goat anti-mouse IgG (Jackson Laboratories) 
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and polyclonal Myc antibody (A14, Santa Cruz) was visualized with Texas-Red- 
conjugated goat anti-rabbit IgG (Jackson Laboratories). Immunofluorescence was 
analyzed on a Leica confocal microscope. 

Transcriptional Response Assay 

Mv1 Lu cells were transiently transfected with the reporter plasmid, p3TP- 
lux (Wrana et al., 1992), CMV-Bgal and selected constructs using calcium 
phosphate transfection. Twenty-four hours after transfection, cells were 
incubated overnight with or without 50 pM TGFp. Luciferase activity was 
measured using the luciferase assay system (Promega) in a Berthold Lumat LB 
9501 luminometer and was normalized to B-galactosidase activity. 

Example 2 - Identific ation of SARA family members 

The MH2 domain of Smad2 was fused to glutathione-S-transferase (GST) 
that included a kinase recognition site for protein kinase A (PKA). The 
bacterial ly-expressed fusion protein was labelled to high specific activity using 
PKA (Chen and Sudol, 1995), and then used to screen a TiZAPII expression 
library prepared from the dorsal blastopore lip of Xenopus. From this screen, 
four clones were identified, all of which presented a repeated isolate of a partial 
cDNA clone with no similarity to sequences in the GenBank database. To 
confirm that the product encoded by this clone interacted with Smad2, an in 
vitro transcription/translation system was used to produce [ 35 S]methionine- 
labelled protein. Translation of the cDNA yielded a protein product of 
approximately 80 kDa which corresponded in size to the longest open reading 
frame (ORF) identified in the sequence. Incubation of this product with 
bacterial ly-produced GST-Smad2(MH2) resulted in efficient binding of the 
translated product to the fusion protein (data not shown). Interaction with full 
length Smad2 was also observed, whereas binding to bacteria I ly-expressed 
Smadl or Smad4 was not. 

To isolate a full length cDNA, the partial clone identified in the 
interaction screen was used as a probe to rescreen the same blastopore lip 



WO 00/05360 




PCT/CA99/00656 



31 



library. Since the resulting clones lacked the 5' end, 5' RACE was conducted to 
obtain the entire coding sequence. Analysis of the complete cDNA sequence 
(Table 5) revealed a long open reading frame that was contiguous with that of 
the partial clone. The predicted protein, XSARA1, is 1235 amino acids long with 
5 an estimated molecular mass of 135 kDa (Table 6). Analysis of the full length 
cDNA sequence (Table 9) revealed a region in the middle portion of the 
predicted protein that had similarity to a double zinc finger domain (recently 
renamed the FYVE domain; Mu et al., 1995). The FYVE domain has been 
identified in a number of unrelated signaling molecules that include FGD1, a 

10 putative guanine exchange factor for Rho/Rac that is mutated in faciogenital 
dysplasia (Pasteris et al., 1994), the HGF receptor substrate Hrs-1 and its 
homolog Hrs-2 (Bean et al., 1997; Komada and Kitamura, 1995), EEA1, a protein 
involved in formation of the early endosome (Mu et al., 1995) and the yeast 
proteins FAB1, VPS27 and VAC1 (Piper et al., 1995; Weisman and Wickner, 

1 5 1 992; Yamamoto et al., 1 995). Comparison of the FYVE domains from the 
vertebrate proteins with that from SARA revealed extensive conservation of 
residues throughout the domain (Table 10). Thus, SARA contains a FYVE 
domain that may fulfill important functions in diverse proteins. 



20 mammalian cells, a human homologue was identified. Using a carboxy-terminal 
portion of XSARA1, a human library was screened and a protein was identified 
that was distantly related to Xenopus SARA (34% identity) and which was also 
sequenced as an EST (KIAA0305). However, no homologs closer to XSARA 
were identified. Thus, degenerate oligonucleotide primers were designed 

25 encoding amino acids in XSARA 1 (Table 9) and HepG2 RNA was used as 

template for degenerate PCR. A related sequence was identified and this partial 
cDNA was used to screen a human brain cDNA library. Four overlapping 
clones, encoding a long open reading frame were identified and a search of the 
EST database with this sequence led to the identification of additional 

30 overlapping cDNA clones from libraries derived from T cells, uterus, endothelial 
cells and melanocytes. Analysis of the contiguous sequence revealed a long 
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open reading frame that had a consensus start codon preceded by stop codons in 
all three reading frames (Table 1). Comparison of the predicted protein hSARAI 
(Table 2), from this cDNA with XSARA1 (Table 9) revealed an overall identity of 
62%, with a divergent 558 residue amino terminal domain (35% identity) 
followed by a closely related carboxy terminus (85% identity). 

Example 3 - hSARA interacts s pecifically with Smari2 and Smad3 

To characterize the interaction of hSARA with Smads, the full length 
protein was translated in vitro and tested for binding to bacterial ly-expressed 
Smad fusion proteins. Similar to the Xenopus clone, hSARAI bound specifically 
to full length Smad2, but not Smadl or Smad4 (Figure 1). In addition, full length 
Smad3, which is highly related to Smad2, also interacted with hSARAI. To 
define the domains of Smad2 that bound hSARA, in bacteria various fragments of 
Smad2 corresponding to the MH1 domain, linker region and MH2 domain were 
expressed in bacteria. Similar to the Xenopus clone, hSARA interacted efficiently 
with fusion proteins that comprised the MH2 domain, while no association was 
detected between hSARA and either the MH1 or non-conserved linker domains 
(Figure 1). Thus, hSARAI specifically interacts with Smad2 through the MH2 
domain. 

To confirm that hSARA also bound to Smads in mammalian cells, a Flag 
epitope tag was introduced at the amino terminus of the protein to create Flag- 
SARA. Transient expression of Flag- SARA in COS cells yielded a protein of the 
predicted molecular weight for SARA (Figure 2) that was not present in 
untransfected cells (data not shown). To investigate the interaction of SARA with 
Smads, Flag- SARA was expressed in COS cells together with Myc-tagged 
versions of Smads 1 , 2, 3, 4, 6 and 7. Cell lysates were subjected to anti-Flag 
immunoprecipitation followed by immunoblotting with anti-Myc antibodies. In 
other immunoprecipitates of cells expressing either Smad2 or Smad3, efficient 
coprecipitation of either Smad with Flag- hSARAI was observed (Figure 2). In 
contrast, none of the other Smads coprecipitated with hSARAI . Specific binding 
of this SARA family member to both Smad2 and Smad3 is consistent with the 
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observation that these two proteins possess very closely related MH2 domains 
(97% identity) and are both activated by TGFp or activin type I receptors (Liu et 
a., 1997b; Macias-Silva et al., 1996; Nakao et al., 1997a). Together, these 
results demonstrate that this SARA family member is a specific partner for 
5 receptor-regulated Smads of the TGFp/activin signaling pathway. 

Example 4- Phosphorylation of Smad2 induces dissociation from SARA 

Previous findings have shown that activation of TGFp signaling results in 
phosphorylation of Smad2 or Smad3 by type I receptors on C-terminal serine 

10 residues (Liu et al., 1997b; Macias-Silva et al., 1996). A constitutively active 

TGFp type I receptor was prepared by substituting a threonine in the GS domain 
with an aspartate residue (Wieser et al., 1995). This activated type I receptor 
induces TGFP signaling in the absence of type II receptors and ligand and 
regulates the phosphorylation and activation of Smad proteins in a manner 

15 similar to ligand (Macias-Silva et al., 1996; Wieser etal., 1995). COS cells were 
transfected with combinations of Smad2, hSARAI or both in the presence or 
absence of activated TpRI. Cells were then metabolically labelled with 
[ 32 P]phosphate and phosphorylation of either hSARAI or Smad2 was assessed in 
immunoprecipitates. Analysis of SARA phosphorylation revealed that the protein 

20 was basal ly phosphorylated and the coexpression of the activated type I receptor 
did not appreciably affect the overall phosphorylation (Figure 3). In contrast, 
analysis of Smad2 immunoprecipitated from total cell lysates showed that the 
activated type I receptor induced strong phosphorylation of the protein as 
described previously (Macias-Silva et al., 1996). These results suggest that SARA 

25 is not phosphorylated in response to TGFp signaling. 

The phosphorylation state of Smad2 that coprecipitated with hSARAI was 
examined. Interestingly, unlike the strong induction of Smad2 phosphorylation 
in the total cellular pool, phosphorylation of Smad2 associated with hSARAI was 
not enhanced, but rather appeared to decrease in the presence of TGFp signaling 
30 (Figure 3). This suggested that receptor-dependent phosphorylation of Smad2 
.might induce dissociation from hSARAI . To examine this directly, the 
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interaction of hSARAI with wild type Smad 2 or a mutant version lacking the C- 
terminal phosphorylation sites (Smad2(2SA)) was analysed. In the absence of 
TGFp signaling, association of hSARAI with either Smad2 or Smad2(2SA) was 
comparable (Figure 4). In contrast, in cells coexpressing the activated receptor, a 
significant decrease in the interaction of wild type Smad2 with hSARAI was 
observed. However, hSARA1/Smad2(2SA) complexes were not reduced by the 
activated receptor. Together, these results suggest that hSARAI is not 
phosphorylated in response to TGF0 signaling and that it preferentially interacts 
with the unphosphorylated form of Smad2. 

Example 5 - SARA and Smad4 form m utually elusive comnlgxes with Smad2 
Phosphorylation of Smad2 induces its interaction with Smad4 (Lagna et 
al., 1996; Zhang et al., 1997). hSARAl/Smad2 complexes in COS cells 
coexpressing Smad4 were assessed. In unstimulated cells, the level of 
hSARAI /Smad2 complex formation was comparable either in the presence or 
absence of Smad4 (Figure 5, lanes 3 and 6). However, upon activation of TGF{3 
signaling, dissociation of Smad2 from hSARAI was significantly enhanced by 
coexpression of Smad4 (Figure 5, lanes 4 and 7). These results indicated that 
phosphorylated Smad2 might preferentially interact with Smad4 rather than 
hSARAI and suggested that Smad2 might form mutually exclusive complexes 
with either Smad4 or hSARAI . The formation of Smad2/Smad4 and 
Smad2/hSARA4 complexes in the same transfectants was then examined. Cell 
lysates were subjected to immunoprecipitation with anti-Flag antibodies directed 
towards tagged Smad2 and then immunoblotted for the presence of Smad4 and 
hSARAI . Consistent with previous findings (Lagna et al., 1996; Zhang et al., 
1997), interaction of Smad4 with Smad2 was strongly stimulated by the activated 
type I receptor (Figure 6, lane 3 and 4). Concomitant with the formation of 
Smad2/Smad4 complexes, the interaction of Smad2 with hSARAI was disrupted 
by activation of signaling (Figure 6, lanes 6 and 7). Thus, complexes of 
Smad2/hSARA1 and Smad2/Smad4 are mutually exclusive, supporting the notion 
that Smad4 may compete for Smad2 to enhance dissociation of hSARAI /Smad2 
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complexes. Together these results demonstrate that during TGFp signaling, 
hSARA1/Smad2 complexes are transient and phosphorylation of Smad2 induces 
dissociation and formation of heteromeric complexes with Smad4. 



5 Example 6- hSARAI regulates the subcellular localization of Smad2 

The studies described above suggest that SARA functions upstream in the 
pathway and might control the subcellular localization of Smad2. To test this, 
an investigation was done to determine whether coexpression of hSARAI might 
alter the localization of Smad2 in the TGFp-responsive epithelial cell line, 
10 Mv1 Lu, using confocal microscopy. Mv1 Lu cells were used rather than COS 

since the Myc antibodies crossreacted with endogenous proteins in the COS and 
obscured nuclear staining of tagged proteins. In cells expressing hSARAI alone, 
the protein displayed a punctate staining pattern that was present throughout the 
cytosolic compartment and was excluded from the nucleus (Figure 7A). This 
1 5 localization of hSARAI was in contrast to the diffuse staining typically observed 
for Smad2 in cells overexpressing the protein (Figure 7B). Cells transiently 
transfected with both hSARAI and Smad2 were examined. In these cells, the 
distribution of hSARAI was indistinguishable from cells transfected with hSARAI 
alone (Figure 7D, left photo). In contrast, the localization of Smad2 in the 
20 presence of hSARAI displayed a dramatic shift to a punctate pattern (compare 

Figure 7B to 7D, centre photos). Moreover, analysis of these immunofluorescent 
staining patterns by confocal microscopy revealed that hSARAI and Smad2 
precisely colocalized in the cytosol (yellow stain, Figure 7D, right photo). 
Interestingly, expression of Smad2 at much higher levels than hSARAI reverted 
25 the distribution of Smad2 to that observed in cells transfected with Smad2 alone 
(data not shown). This supports the notion that elevating the amount of Smad2 
can saturate hSARAI and yield a diffuse distribution of Smad2 throughout the 
cell. 

Studies were conducted to determine whether activation of TGFp 
30 signaling induces nuclear translocation of Smad2 in the presence of hSARAI . As 
shown in Figure 7, the localization of hSARAI in the cytosolic compartment 
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looked similar in the presence or absence of the constitutively active TGFp type I 
receptor (compare Figure 7D and E, left photos). However, TGFp signaling 
caused a significant proportion of Smad2 to translocate to the nucleus (Figure 7E, 
centre photo) and this correlated with a shift to an orangy-red colour in the 
5 cytosolic colocalization stain (Figure 7E, right photo). Thus activation of TGFp 
signaling induces Smad2 to dissociate from hSARAI and translocate to the 
nucleus. 

To confirm that the punctate localization of overexpressed SARA reflected 
that of the endogenous protein, the localization of endogenous SARA and Smad2 

1 0 was examined in Mv1 Lu cells. Analysis of the distribution of endogenous 
hSARAI using affinity-purified rabbit anti-hSARA1 antibodies revealed a 
punctate distribution that was similar to the pattern observed for transiently 
transfected, epitope-tagged hSARAI (Figure 7F, left photo). This staining was 
specific, since cells stained with preimmune antisera, or purified antibody 

1 5 blocked with the hSARAI antigen, revealed no detectable staining in the cytosol, 
although some weak background staining was observed in the nucleus (data not 
shown). Examination of endogenous Smad2 distribution in the same cell using 
goat anti-Smad2 antibodies revealed a punctate distribution for Smad2 (Figure 
7F, centre photo) as published previously (Janknecht et al., 1998). Furthermore, 

20 analysis of hSARAI and Smad2 together revealed extensive colocalization of the 
two proteins (Figure 7F, right photo). Colocalization was not complete and may 
reflect differences in the stoichiometry of hSARAI versus Smad2 protein levels as 
suggested above, or the presence of additional regulatory mechanisms in the cell 
that control interaction of the endogenous proteins. 

25 Taken together with the biochemical analysis, these results indicate that 

hSARAI functions to anchor or recruit Smad2 to specific subcellular regions 
prior to activation by TGFp signaling. 



30 



Example 7 - hSARAI co-localk^ with TftRH 

The positioning of hSARAI upstream of Smad2 activation suggested to us 
that hSARAI might recruit Smad2 to specific subcellular domains for 
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phosphorylation and activation by the receptor. Interestingly, previous studies 
on the TGFp receptor demonstrated clustering of the receptor complex into 
punctate domains that resembled those displayed by hSARAI (Henis et al., 
1994). To test whether hSARAI might colocalize with TGFp receptors, the 
5 subcellular localization of hSARAI and TGFp Mv1 Lu receptors was investigated 
in MvlLu cells. Endogenous TGFp receptors could not be detected, likely due 
to the low numbers of TGFp receptors present on these cells and the even fewer 
number that are activated in the presence of ligand. The localization of hSARAI 
in Mvl Lu cells cotransfected with TpRII and treated with TGFp was therefore 

10 examined. In the absence of hSARAI, TpRII displayed a punctate staining 

pattern similar to the hSARAI pattern (Figure 8A, panels i and ii, respectively), as 
observed previously in COS cells. Furthermore, in cells coexpressing hSARAI 
and TGFp receptors, extensive colocalization of hSARAI and TpRII was 
observed (Figure 8A, panel iii). This colocalization was not complete. This may 

15 be due to a restricted distribution of hSARAI in only a subset of the intracellular 
compartments normally occupied by transmembrane receptors, which include 
the endoplasmic reticulum, Golgi and endocytic pathways. Thus, hSARAI and 
the TGFp receptors colocalize to common subcellular domains. 

The colocalization of hSARAI and the TGFp receptors suggested the 

20 possiblity that hSARAI may interact with the TGFp receptor. To test this, a 

strategy was utilised similar to that employed to characterize the interaction of 
Smad2 with the TGFp receptor (Macfas-Silva et al., 1996). Briefly, COS cells 
were cotransfected with TGFp receptors in the presence of hSARAI and were 
affinity-labelled using [ 125 l]TGFp. hSARAI was then immunoprecipitated from 

25 the cell lysates and coprecipitating receptor complexes were resolved by SDS- 
PAGE and visualized by autoradiography or were quantitated using a gamma 
counter. Analysis of cells expressing wild type receptors type II and type I, 
revealed that receptor complexes coprecipitated with hSARAI (Figure 8B, lane 
3). Furthermore, in the presence of kinase deficient type I receptor, there was a 

30 small increase in binding of hSARAI to the receptor (Figure 8B, lane 2). This is 
, in contrast to Smad2, which only interacts with TGFp receptor complexes that 
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contain kinase deficient type I receptors (Maci'as-Silva et al., 1996). These data 
suggest that hSARAI associates with the TGFp receptor. 

Next examined was whether coexpression of Smad2 might enhance the 
interaction of hSARAI with TGFp receptors. In cells expressing wild type 
5 receptor I, no difference in the amount of receptor complexes that coprecipitated 
with hSARAI, either in the presence or absence of Smad2, was observed (Figure 
8B, compare lanes 3 and 5). In contrast, the association of hSARAI with 
receptor complexes containing kinase-deficient type I receptors was enhanced 
by Smad2 (Figure 8B, lane 4). This finding was consistent with the previous 

1 0 demonstration that kinase-deficient type I receptors stabilize interactions of 

Smad2 with the receptors. To investigate further the requirement for Smad2 in 
the interaction of hSARAI with the receptor, a mutant of hSARAI, SARA(ASBD), 
that removes the Smad binding domain, was tested. Analysis of wild type 
hSARAI interaction with receptor complexes containing kinase-deficient TpRI 

1 5 showed that wild type hSARAI interacted with the receptor and this was 

enhanced approximately two-fold by Smad2 (Figure 9A). The ASBD mutant of 
hSARAI retained the capacity to associate with the receptor, although the 
efficiency of interaction was slightly reduced relative to wild type hSARAI. 
Importantly, unlike wild type hSARAI, binding of mutant hSARAI to the 

20 receptor was not enhanced by coexpression of Smad2. Together, these data 
suggest that hSARAI interacts with the TGFB receptor independently of Smad2 
binding and that Smad2 cooperates to enhance the association. 

To further characterize the domains in SARA that mediate binding to the 
TGFp receptor, the interaction of a panel of SARA mutants with the TGFp 

25 receptor was tested. Interestingly, interaction with the TGFp receptor was 

strongly suppressed in three mutants in which the FYVE domain was disrupted 
(Figure 9B; A594, A664 and the internal deletion A597-665). Since the FYVE 
domain is required for the correct subcellular localization of SARA, it was 
postulated that, once bound to the membrane, other regions in SARA might 

30 contribute to the interaction with the receptor. To examine this possibility, 
.several carboxy-terminal truncation mutants of hSARAI were tested. 
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Interestingly, deletion of the C-terminus downstream of position 750 suppressed 
receptor interaction, despite efficient expression of the truncated protein. This 
suggests that regions in the carboxy-terminus of SARA contribute to receptor 
interaction. In these analyses, the question of whether overexpression of Srnad2 
5 could rescue some interaction of SARA mutants with the receptor was also 
explored. For both the FYVE domain mutants and the C-terminal truncation, 
Smad2 expression was able to restore some interaction with the TGFp receptor. 
It is likely that the high levels of protein and receptor expression that are 
achieved in COS cells can drive some receptor interaction, even in the absence 
10 of appropriate localization signals. 



Example 8 - A modular domain in SARA mediates association with Smads 

To investigate the functional importance of SARA in TGFp signaling, the 
domains in the protein that mediate both its localization to specific subcellular 

1 5 regions and its interaction with Smad2 were defined. To this end, a series of 
deletion mutants of hSARAI were constructed and tested for their ability to 
interact with Smad2 in COS cells by immunoprecipitation followed by 
immunoblotting. As summarized in Figure 10, loss of the first 665 amino acids 
of hSARAI, which included the double zinc finger/FYVE domain, did not 

20 interfere with hSARAI binding to Smad2. However, further deletions (A1-704) 
completely abolished the interaction of Smad2 with hSARAI . To map the 
carboxy-terminal boundary of the Smad binding domain, a number of C-terminal 
truncations were also analyzed. Deletion of all residues downstream of position 
750 did not affect Smad2 interaction with hSARAI, while an additional loss of 

25 85 amino acids (A665-1323) completely abrogated binding to Smad2. To 

determine whether the region defined by this deletional analysis was sufficient to 
bind Smad2, the 85 amino acids referred to as the Smad Binding Domain (SBD) 
were linked to GST and the fusion protein was expressed in bacteria (GST-h 
SARA(665-750)). Incubation of lysates prepared from cells expressing Smad2 or 

30 Smad3 with GST-SBD resulted in efficient binding of both Smads to the fusion 
protein (Figure 11 A). This interaction is likely direct, since bacterially expressed 
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SBD associates efficiently with bacterially-produced Smad2 (data not shown). 
These studies thus define a novel domain in SARA that mediates interaction with 
Smad2 and Smad3 and which is located downstream of the FYVE domain. 

The above-described analysis in COS cells showed that phosphorylation 
5 of Smad2 by the TGFB receptor induced dissociation from SARA. To determine 
whether this reflects an alteration in the ability of the SBD to bind 
phosphorylated Smad2, the interaction of GST-SB D with Smad2 in lysates 
obtained from cells expressing Smad2 alone, or Smad2 together with either wild 
type or activated TGFp type I receptor, was tested. As described previously, 
coexpression of activated type I receptors with the appropriate receptor- 
regulated Smad yields efficient phosphorylation of Smad protein. In lysates from 
cells expressing Smad2 alone or Smad2 with wild type receptors, efficient 
binding of Smad2 to GST-SBD was observed. In contrast, in the presence of 
activated TBRI, the interaction of Smad2 with GST-SBD was strongly reduced 
1 5 (Figure 1 1 B). This reduction correlated with receptor-dependent 

phosphorylation, since the phosphorylation site mutant, Smad2(2SA), interacted 
efficiently with GST-SBD, even in the presence of activated TBRI (data not 
shown). These data strongly support a mechanism whereby SARA interacts with 
unphosphorylated Smad2 and receptor-dependent phosphorylation induces 
20 dissociation by altering the affinity of Smad2 for the SBD. 



10 



Example 9 - The FYVE domain controls th*. su b cellular Inrajization of SARA 

The subcellular localization of a selection of the SARA mutants was 
analysed by immunofluorescence and confocal microscopy. Analysis of 

25 truncation mutants that removed the amino terminus upstream of the FYVE 
domain (A1-531) yielded wild type patterns of staining (Figure 12, compare 
panels i and ii). However, a further deletion (A1-664) that disrupted the FYVE 
domain but did not interfere with the Smad binding domain, abolished the wild 
type staining pattern (Figure 12, panel iii). Similar studies of the C-terminal 

30 domains showed that residues downstream of the FYVE domain (A665-1 323) did 
not alter the localization of the mutant protein (Figure 12, panel iv), while 
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truncations within the FYVE domain (A596-1323) led to diffuse localization 
throughout the cell (Figure 12, panel v). Of note, the A665-1323 mutant lacked 
the Smad binding domain, thereby indicating that interaction with Smad2 is not 
required for proper SARA localization. To confirm that FYVE domain function 
5 was required for localization of SARA, a mutant with a small internal deletion 
that removes the FYVE domain (A597-664) was tested. Consistent with the other 
mutants, localization of this protein was clearly disrupted (Figure 12, panel vi). 
Since none of these mutants interfered with Smad binding, the FYVE domain 
appears to be required to maintain the normal localization of SARA but is not 
10 involved in mediating interactions with Smads. 

Example 10 - SARA-mediated localization of Smad2 is necessary for TGFB 
signaling 

The availability of mutants of hSARAl that interact with Smad2 but fail to 
1 5 target to the appropriate subcellular sites allowed the question of whether 

hSARAl -mediated localization of Smad2 was important to TGFp signaling to be 
addressed. Whether SARA(A1-594) and SARA(A1-664), which bind Smad but 
fail to distribute to the correct subcellular domains, would mislocalize Smad2 
was examined. Coexpression of either mutant with Smad2 showed that they 
20 were unable to recruit Smad2 to the normal SARA domains (Figure 13A, panels i 
and ii). As expected, SARA(Al-704), which lacks a Smad binding domain, was 
unable to control Smad2 localization (Figure 13A, panel iii). Whether these 
mutants could cause mislocalization of Smad2 was also examined. For this, 
cells were cotransfected with wild type hSARAl and Smad2 either in the 

25 absence or presence of SARA(Al-594), SARA(A1-664) or SARA(A1-704). In 

control transfectants, performed in the absence of mutant hSARAl, hSARAl and 
Smad2 were colocalized in punctate domains as described above (Figure 13B, 
panel i). However, in the presence of either SARA(A1-594) or SARA(A1-664), 
the localization of wild type hSARAl was normal, but the distribution of Smad2 

30 was clearly disrupted and displayed a diffuse pattern (Figure 13B, panels ii and 
iii, respectively). Moreover, coexpression of SARA(A1-704), which does not 
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bind Smad2, resulted in Smad2 distribution that was indistinguishable from that 
of the wild type pattern (Figure 13B, panel iv). Thus, SARA(A1-594) and 
SARA(A1-664) induce the mislocalization of Smad2. 

Since SARA(A1-664) mislocalizes Smads and interferes with receptor 
5 association, we investigated whether this mutant would disrupt TGFfJ signaling. 
To test this, we transiently transfected the TGFp-responsive reporter gene 3TP- 
lux into Mvl Lu cells in the presence and absence of wild type or mutant 
versions of hSARAI . Expression of wild type hSARAI had no effect on TGFp 
signaling (Figure 14). In contrast, transfection of SARA(A1-664) significantly 

1 0 inhibited TGFp-dependent signaling at the lowest concentration of DNA tested, 
while transfection of higher doses completely abolished responsiveness of the 
cells. We also tested SARA(Al-704) which lacks a functional Smad binding 
domain and does not alter Smad2 localization. Transfection of this mutant had 
no effect on TGFp signaling (Figure 14). In addition to analysis of the 3TP 

1 5 promoter, we examined induction of the activin response element (ARE) from 
the Xenopus Mix.2 gene in HepG2 cells. 

This ARE is stimulated by either TGFp or activin signaling, which induces 
assembly of a DNA binding complex that is composed of Smad2, Smad4 and a 
member of the FAST family of forkhead DNA binding proteins. Since HepG2 

20 cells do not possess endogenous FAST activity, wild type or mutants of hSARAI 
were cotransfected with FAST2 and the ARE-lux reporter plasmid as described 
previously (Labbe et al., 1998). Expression of either SARA0-A594) or SARA0- 
A664), which interfere with or delete the FYVE domain, respectively, resulted in 
a strong suppression of TGF0-dependent induction of the ARE (Figure 15). 

25 However, none of the other mutants tested suppressed activation of this 

promoter. Since none of these latter mutants disturb the localization of hSARAI - 
Smad2 complexes, these data strongly suggest that recruitment of Smad2 to the 
receptor-containing subcellular domains is important for TGFp signaling. 
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Example 1 1 - Tissue di stribution of hSARA ex P r«»«inn 



WO 00/05360 




PCT/CA99/00656 



43 



The 3'UTR of hSARAI and a Smad2 cDNA fragment were used to probe 
a human multiple tissue Northern blot (Clontech). The results are shown in 
Figure 16 - hSARAI : upper panel and Smad2: lower panel. hSARAI and Smad2 
were ubiquitously expressed in the tissues examined; relatively low levels of 
5 hSARAI were selected in liver. hSARAI and Smad2 showed a similar 
expression pattern except in placenta, where proportionally more Smad2 
message was observed. A single transcript of 5.0 kb is seen, corresponding to 
the full length hSARAI cDNA. 

SARA expression was examined in a variety of cell lines using RT-PCR 
10 analysis and the gene was found to be expressed in every cell line tested. These 
included HepG2 hepatoma cells, NBFL neuroblastoma cells, SW480 colorectal 
cancer cells, N1H 3T3 fibroblasts, P19 embryonic carcinoma cells, MC3T3 
calvarial cells and Mv1 Lu lung epithelial cells (data not shown). hSARAI 
appears to be a ubiquitously expressed partner for Smad2 and Smad3. 

15 

Example 12 - Interaction of endogenous hSARAI and Smad2 in mammalian 
cells 

Lysates from HepG2 cells, either untreated or treated with InM TGFp, 
were immunoprecipitated with an affinity-purified, anti-hSARAl rabbit 

20 polyclonal antibody and the immunoprecipitates were immunoblotted with a 
polyclonal, anti-Smad2 antibody (Macias-Silva et al., 1998). Controls were 
immunoprecipitated with pre-immune sera or N19 anti-Smad2/3 antibody. The 
results are shown in Figure 1 7. In immunoprecipitates prepared with 
preimmune antisera, no Smad2 was detectable. Anti-hSARA1 

25 immunoprecipitates clearly showed Smad2 co-precipitating with hSARA! . TGFp 
treatment prior to lysis gave decreased association of Smad2 and SARA. 

These results demonstrate that SARA is a specific partner of receptor 
regulated Smads in the TGFp/activin signaling pathway and further suggest that 
TGFp signaling induces dissociation of SARA/Smad complexes. 
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The present invention is not limited to the features of the embodiments 
described herein, but includes all variations and modifications within the scope 
of the claims. 
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TABLE 1 - hSAR Al - Sacruence IS wq.i 



GTGTTAGATGAATTTGAACAAAACGAAGATGAAAC^GTTTCTTCTACTTTA^r^rn 
CAAAGTGGAATAAGATTCTAGATCCCCCTTCTCACCGGCTGTCATTTAACCCT 

^ GGA Z^ GG ^ GTATAGGTA ^^CCCCTCCATGTCTGCGATTACAAGT^^ 
JI G ^™ TAAT ^ CATCC ^ GGGAA ^ GATGG ATGTCCTGCTGT^ 
?^^ A * A ? AC ^ GATGAGGACCTCACTG ^ 
^S?^F CC ^ CCCACATG ^ 




™^ G ^ AATGGGTCTGG ^ GG AATAATGACTGTGAACGGTGTTCAGATTGCOT 

S GC ™ AA * GAA i™ GGGCTGATGAA ^^ 

A ? G Z GA ^ C TCCACTGTTATAGACACACCAGCAGCAAATTATCTATC 



G ' G JE G J G JE^ G F AG , ATGGGTA ^ 

AGG J2™^ CACCAAAAGGAGG ^ 

^S^i G i AGCCTGAAATGTAAACTGTOTA ^TGGACAGAA^ 
S G ZSI™F GCCATTCAGTGCTA ATC^TGCTCAAGCCTGGGAGAA^ 

GG ^^^^^^^^ G ^^^ GG ^^^GGAGCTCTGAGCTCTCCACCTCCCACTGTGATGGT 

5?aS™S G ™^ C F oATGGGATCTTGCC ^ TGGAGAAG TTGCTGATGCAGCCAAAT 
I AACAA ^ GAA ^ GGAACTTCCT -TGC^GGAACCCTGGCTGTGTCACACGACCCA 




ScSSSSSS^™ ^^^^c^'^^ctgagatagtcattct 

- CAG . i_AAAG . , GGCAGTAAAGAACATGGTGGATTCTTATATGTGACATCTAC 
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TABLE 1 - hSARAl Continued 

CTACCAGTCACTGCAAGACCTAGTACTCCCaACCCCACCTTACTTGTTTGGGATTCTT 
ATCCAGAAATGGGAAACTCCTTGGGCTAAAGTATTTCCTATCCGTCTGATGTTGAGAC 
TTGGAGCTGAATATCGACTTTATCCATGCCCACTATTCAGTGTCAGATTTCGGAAGCC 
ATTGTTTGGAGAGACGGGGCATACCATCATGAAT C TTCTTGCAGACTTCAGAAATTAC 
CAGTATACCTTGCCAGTAGTTCAAGGTTTGGTGGTTGATATGGAAGTTCGGAAAACTA 
G CATCAAAATTCCCAGCAACAGATACAATG AGATGATG AAAG C CATG AAC AAG TC C AA 
TGAGCATGTCCTGGCAGGAGGTGCCTGCTTCAATGAAAAGGCAGACTCTCATCTTGTG 
TGTGTACAGAATGATGATGGAAACTATCAGACCCAGGCTATCAGTATTCACAATCAGC 
CCAGAAAAGTGACTGGTGCCAGTTTCTTTGTGTTCAGTGGCGCTCTGAAATCCTCTTC 
TGGATACCTTGCCAAGTCCAGTATTGTGGAAGATGGTGTTATGGTCCAGATTACTGCA 
GAGAACATGGATTCCTTGAGGCAGGCACTGCGAGAGATGAAGGACTTCACCATCACCT 
GTGGGAAGGCGGACGCGGAGGAACCCCAGGAGCACATCCACATCCAGTGGGTGGATGA 
TGACAAGAACGTTAGCAAGGGTGTCGTAAGTCCTATAGATGGGAAGTCCATGGAGACT 
ATAACAAATGTGAAGATATTCCATGGATCAGAATATAAAGCAAATGGAAAAGTAATCA 
GATGGACAGAGGTGTTTTTCCTAGAAAACGATGACCAGCACAATTGCCTCAGTGATCC 
TGCAGATCACAGTAGATTGACTGAGCATGTTGCCAAAG C TTTT T GCCTTGCTCTCTGT 
CCTCACCTGAAACTTCTGAAGGAAGATGGAATGACCAAACTGGGACTACGTGTGACAC 
TTGACTCAGATCAGGTTGGCTATCAAGCAGGGAGCAATGGCCAGCCCCTTCCCTCGCA 
GTACATGAATGATCTGGATAGCGCCTTGGTGCCGGTGATCCATGGAGGGGCCTGCCAG 
CTTAGTGAGGGCCCCGTTGTCATGGAACTCATCTTTTATATTCTGGAAAACATCGTAT 
AAACAGAGAAGACTTCATTTTTTTCTGTTCAGACTTGTTGCAACAGCAGTCATACCCA 
AATCATTTGCACTTTAAAACTGGAAGATTAAGCTTTTGTTAACACTATTAATGGGGTG 
GGGAATAGGGTGGGAGTGGGGGTTTGGGAGACGGGTGGGAAAGGGTGGTTGGGGGGAC 
CGATGTTCCATAATTCTAAGTCTTCTATGCATTGTCCACCAAGAAGATCTGGGCAGCT 
TCTGTTCCTGCACAACAGTTATGCTATCCTTGCAGCTAATCCCCTTCTGTTACTGTTT 
AGACAAGAATTCCGCTCCTCTCTCAAGATTTACTTATGGTCATGTGCTCAGAAATGCT 
CAAATGGGTACAACCATCACCAAGGGTGGGATGGGAGGGCAGAGGGGAAATAAAATAT 
AAAGCATCAAAAAAAAAAAAAAAAA 
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TABLg 2 - hSAH Al - Sequence ID NOt2 

TLOTDLQDCNlTYlISQSLMDAf'SCSLDllENRQTDOFSFSINESTEKDMNSEKQMDPLNR 
S^™}[1?F CPTSSDS ^ VCSPS0LKDDGSIGRD PSMSAITSLTVDSVISS0GTD 
°E PA Y^^ IPDEDLTGKISSPRTO LGSPNSFSHMSEGILMKKEPAEESTTEESLR 
^ G ^ P ^ l ^ 4 ^EMPNGSGRNiroCERCSDCLVPNEVRADENEGYEHE£TIiGTT£FLNMTEH 
™f52 rm3 ^ TK ^™ SQ ^ E ^ CTL QISOPEDTNGDSGGQCVGLADAGLDLK 
GTCISESEECDFSTVIDTPAANYLSNGCDSYGMQDPGVSFVPKTLPSKEDSVTEEKEI 
EESKSECySNIYEQRGNEATEGSGLLLNSTGDLMKKNYLHNFCSQVPSVLGQSSPKW 

™^^ CSPSLGNIS ^™ GEmESYEAEIS TRPCLALAPDSPDNDLRAGQFGISA 

™SSE^ V ^ IraSVI ^ A0AWE1 ^ S ^ s O SPNP ^PAEYCSTIPPLQ0AQASGA^ 
f™2^^ V ^ rapGA ^ AQPMQ ^ WFMGILPNGE VADAA^ 

ss™^ tosplpaeroi ^ sgsit q vgspvgs amnlip e °glppilistgvk 
g ™^ e ^sqisvmqqledggpdplvfvi^a^^ 

X G 2f™5^ QCLPDEKCLPraiFraFVQL ^ A ^ GNVV SNLGHSFFSQSFLGSKEH 
GGF ?J^ S ^ 0SL Q DL ^ p T pp ^FGILIQK^ 

™Y5£P^ LFGETGOTIMNL ^ rROT QmpwQGL 

MMKAM^NEHVIAGGACFNEKADSHLVCVQNDDGNYQTOAISIHNQPR 
„^^ SSGYLJ ^ SI ^ DG ^^QITAENm^ 

SJ™2^ DD ^ SKGWSPIDG ^^ TI1 ^ IFHG SEyKANGIOriRWTEVFFLEiro 
°S^S^ SDPADHS ^ TEHVA ^ C ^ CT Q L ^KGDGMTKiGLRVTLDSDQVGyOAG 
SNGQHLPSQYMNDFDSDLVKMIHGGACQLSEGPWMELIFYILENIV 
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TABLE 3 - human SARA 2 - Sequence ID NO; 3 

ACTCCCGGCCGGGGTAGCTCTTCACTCCTCAGCGCGACGTCGTGTCGAGTTCCZAAAA. 

AGCTCCGCAGGGGCTGTAGGGAGGTGATCTCATCCATTAACAGCTGTGTGTTGCCAG'T 

TCCCAAATCTTTATCTATCTCAGACTTCTCTCCTGCATTCCAGATTCTTATATTCAGC 

TGCCTTTTGGATATCTCTCCCAGGATGTTCTCAAGGCATACAAGAATTAAATTwTGAA 

TAAGTCTGCAGGTAGGATGGACAGTTATTTTAAAGCAGCTGTCAGTGACTTGGACAAA 

CTCCTTGATG A TTTTGAACAGAACCCAGATGAACAAGATTATCTCGCAGATGTACAAA 

ATGCATATGATTCTAACCACTGCTCAGTTTCTTCAGAGTTGGCTTCCTCACAGCGAAC 

TTCATTGCTCCCAAAAGACCAAGAGTGCGTTAATAGTTGTGCCTCATCAGAAACAAGC 

TATGGAACAAATGAGAGTTCCCTGAATGAAAAAACACTCAAGGGACTTACTTCTATAC 

AAAATGAAAAAAATGTAACAGGACTTGATCTTCTTTCTTCTGTGGATGGTGGTACTTC 

AGATGAAATCCAGCCGTTATATATGGGACGATGTAGTAAACCTATCTGTGATCTGATA 

AGTGACATGGGTAACTTAGTTCATGCAACCAATAGTGAAGAAGATATTAAAAAATTAT 

TGCCAGATGATTTTAAGTCTAATGCAGATTCCTTGATTGGATTGGATTTATCTTCAGT 

GTCAGATACTCCCTGTGTTTCTTCAACAGACCATGATAGTGATACTGTCAGAGAACAA 

CAGAATGATATCAGTTCTGAATTACAAAATAGAGAAATCGGAGGAATCAAAGAATTGG 

GTATAAAAGTAGATACAACAGTTTCAGATTCCTATAATTACAGTGGAACAGAAAATTT 

AAAAGATAAAAAGATCTTTAATCAGTTAGAATCAATTGTTGATTTTAACATGTCATCT 

G CTTTGACTCGACAAAGTTC CAAAATGTTT CATG CCAAAGACAAG CTACAAC ACAAG A 

GCCAGCCATGTGGATTACTAAAAGATGTTGGCTTAGTAAAAGAGGAAGTAGATGTGGC 

AGTCATAACTGCCGCAGAATGTTTAAAAGAAGAGGGCAAGACAAGTGCTTTGACCTGC 

AGCCTTCCGAAAAATGAAGATTTATGC7TAAATGATTCAAATTCAAGAGATGAAAATT 

TCAAATTACCTGACTTTTCCTTTCAGGAAGATAAGACTGTTATAAAACAATCTGCACA 

AGAAGACTCAAAAAGTTTAGACCTTAAGGATAATGATGTAATCCAAGATTCCTCTTCA 

GCTTTACATGTTTCCAGTAAAGATGTGCCGTCCTCATTGTCCTGTCTTCCTGCGTCTG 

GGTCTATGTGTGGATCATTAATTGAAAGTAAAGCACGGGGTGATTTTTTACCTCAGCA 

TGAACATAAAGATAATATACAAGATGCAGTGACTATACATGAAGAAATACAGAACAGT 

GTTGTTCTAGGTGGGGAACCATTCAAAGAGAATGATCTTTTGAAACAGGAAAAATGTA 

AAAG CATACTCCTTCAGTCATTAATTG AAGGGATGGAAGACAGAAAG AT AG AT C CTG A 

CCAGACAGTAATCAGAGCTGAGTCTTTGGATGGTGGTGACACCAGTTCTACAGTTGTA 

GAATCTCAAGAGGGGCTTTCTGGCACTCATGTCCCAGAGTCTTCTGATTGTTGTGAAG 

GTTTTATTAATACTTTTTCAAGCAATGATATGGATGGGCAAGACTTAGATTACTTTAA 

TATTGATGAAGGCGCAAAAAGTGGCCCACTAATTAGTGATGCTGAACTTGATGCCTTT 

CTGACAGAACAGTATCTTCAGACCACTAACATAAAGTCTTTTGAAGAAAATGTAAATG 

ACTCTAAATCGCAAATGAATCAGATAGATATGAAAGGCTTAGATGATGGAAACATCAA 

TAATATATATTTCAATGCAGAAGCAGGAGCTATTGGGGAAAGTCATGGTATTAATATA 

ATTTGTGAAACAGTTGATAAACAAAATACAATAGAAAATGGCCTTTCTTTAGGAGAAA 

AAAGCACTATTCCAGTTCAACAAGGGTTACCTACCAGTAAGTCTGAGATTACAAATCA 

ATTATCAGTCTCTGATATTAACAGTCAATCTGTTGGAGGGGCCAGACCTAAGCAATTG 

TTTAGCCTTCCATCAAGAACAAGGAGTTCAAAGGACCTGAATAAGCCAGATGTTCCAG 

ATACAATAGAAAGTGAACCCAGCACAGCAGATACCGTTGTTCCAATCACTTGTGCTAT 

AGATTCTAGAGCTGATCCACAGGTTAGCTTCAACTCTAATTACATTGATATAGAAAGT 

AATTCTGAAGGTGGATCTAGTTTCGTAACTGCAAATGAAGATTCTGTACCTGAAAACA 

CTTG CAAAG AAGGCTTGGTTTTGGG C C AG AAACAGC CTACTTG GGTT C C T G ATT CAG A 

AGCTCCAAACTGTATGAACTGCCAAGTCAAATTTACTTTTACCAAACGGCGACACCAT 

TGCCGAGCATGTGGGAAAGTATTTTGTGGTGTCTGTTGTAATAGGAAGTGTAAACTGC 

AATATCTAGAAAAGGAAGCAAGAGTATGTGTAGTCTGCTATGAAACTATTAGTAAAGC 

TCAGGCATTTGAAAGGATGATGAGTC CAACTGGTTCTAATCTTAAGTCTAAT CATTCT 

GATGAATGTACTACTGTCCAGCCTCCTCAGGAGAACCAAACATCCAGTATACZTTCAC 

CAGCAACTTTGCCAGTCTCAGCACTTAAACAACCAGGTGTTGAAGGACTATGTTCCAA 

AGAACAGAAGAGAGTATGGTTTGCAGATGGTATATTGCCCAATGGTGAAG7TGCAGAT 

ACAACAAAATTATCATCTGGAAGTAAAAGATGTTCTGAAGACTTTAGTCTT2TTTCAC 

CTGATGTGCCTATGACAGTAAACACAGTGGATCATTCCCATTCTACTACAGTGGAAAA 

G CGAAACAATG AGACAGG AG ATATTACAAG AAATG AG ATAATT CAG AG T Z GTATTTCT 

C AGG TT C CATCAGTGGAAAAATT G T CTATG AACACAG G AAATG AG G G G TT A C ZTACTT 
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TABLE 3 - human S ARA2 - Continued 

^tEZE^Z^^ ?HZ^?TiF^ A ^ AJ *^^*^ N ^^^'^ CTATTG CTAGTATTTCAGATTATAG GTTA 
™^^^^™ G ^ TGTCTG ^ T ^ G -TTAGTCTTCTACCTAATGATGAGG 
ACAGTTTGCCCCCACTTCTGGTTGCATCTGGAGAAAAGGGATCAGTGCCTGTAGTAGA 
JS^^ TCCATCT ^ T ^ G ^^ TttT ^GCTTCTTGAAGGTGAAGGCTTTCATCC- 



~ ~ ~ — A^iuLii^A^ ^*iM\»V* i. OAiiU^C i T * CATC C 

S3FH^ ^^ G ^^^ AAA E G ^?^^^^^^^ G ^ G ^^ TG ^CA^^TTCATATTTTATTCC~ 
C ^ A 5 AAA I A ]I G j^ 




™II^ CmACTGAGAGTOTCT » G TAGCAAGGATCACGGAGGATTCCTGTTTAT 

TACACCTACTTTTCAGAAACTTGATC^TCra 

S^^TTATCCAGAAGCTTGAGAT^^^ 

TGTTGAGATTGGGTGCAGAATATAAAGCATATCCTGCTCCTCTAACAAGCATCA.GAG" 
^^?J A ^ GTATACOTG ^ TAATATAGATCAAG TGTTGATTC^^^ 
^SIE CCAATGAGCATGT ^rcAGCATTGGAG 
G £^£ CTAGAAAAGTGA » GG TGCAAGTm^ 

ACTCCAGAGACCATGAATGGCTTGCGGCTAGCTTTACGAGAACAGAtAS^ 
^^^^^*^ AAAG ^ GA ^ G ^ G ^ AGAG ^ GAGAGAA ^ GGTGG ^^^ GTGG TGGGT 

i? A I2E G i^S AAAAAGGAAACAAAG ^ 
^ G , GA ^ CCAAGTGAAAAAATA ^ 

TTGTAAAATGTACCGAGGTGTTCTACTTTCTAAAGGACCAGGATTTATCTATTTTATC 

AACTTCTTATCAGTTTGCAAAAGAAATAGCCATGGCTTGTAGTGCTGCGCTGTGCCCT 

CACCTGAAAACTCTAAAAAGTAATGGGATGAATAAAATTGGACTCAGAGTTTCCATTG 
ACACTGATATGGTTGAA.TTTCAGGCAGGATCTG Alftcr n a rTTrr^r-PTr^ rn» 




~ Z^-~~ZV ii.ivaiiiii.Liijiwii v-i-ATGGTGGGACCTCCAACTCT 

AGpTACCATTAGAAATAGAATTAGTGTTTTTCATTATAGAAC^TCTTTTTTAGTGAA 
AG 4 A ? G I^E ATATOCATA ^ GCAACCT ^ TT - G ^AAAACTAACT 
G £I G ^^ TG ^^ CAAACACTAAAA ^^ 

A ^^ AA 5 AGATC ^ TACmCTGTAGG TTTACAATTAATTTAAGTACT 

S£f G £E^ c ^ TAAGAA ^^ 

™ A * C I GCAC ^ AGGTATACCTC1 ^ A ^^ 

E E A ^^E ?^ A ^ GG "^ A ^ GAAGGAAA ^ A ^ G ^ G CTTTGTGTATATG CCAGTTAGAATACTG 
G ?^J AAAGTCTGTCAAA ^ GTATOC AGTGGCACAAAAACCAGTTTT 
G ^ G ^E A ^ AA ^3^^^^ GAA ^ AAAA ^^ GA ^ AACT ~ A TTTGTATAATTGGAGTGGAGACCr 
«ES^^^^^^ GA ^ AAAG ^ G ^^^^ GGA ^ A ^ AA TCAGAATTTTGCCTTTTTTCT ,T ' 

ATGGATATTAAAATAATTGCGGGTGCTTCAGGACTTTTTGCTTCTATATTTAAGTATA 

TAGGxAAT^TTrTTGTATCACAATGCATTATTTrTrTTCCTCCTTTCCTTCCAAACT* 

TAGTCAT * -".ToTAGCAATAAATGAAGCCTGAAACAGGTTTTTTTACTTCCACTrTAAT 
C ^ AGAAA P r - rrGGCAAC ^ GGG ATATTrTCATTGACACTGGTGTATAAaTATAt 

^^^^^^^^ ~^^' A /?3'C^? A /^E" A ^' G '^ > ^'^ AAT ^ ^^«' I '«~«-~~TGAAA.TTGTTAC AG AGCjTTT 

?5r-^~~:-^ CAGCAA ^^ 
I A ^ AA ^?::: AA ?2^ A ^ 

S52££?- rrCC^E^ JE ATAT ATTrCAAGAAAAAGAAATTATGTTAAATAGCCCr 

??HI AAG ^^^™H: A ^ GAAGCATCT=A ^ 

ISSf G cir _™->x: E AA ^ gagaga "~^" g, ^^^^ aa ~ ga ^^~^gcata7 

AG "^ir^-l™irC^:E T5TAGTACTTAGCZ ™ GACAAT — 3SAGGAGAATATZ 
— ^-*«-TTCCTGAACAATGTAAAGrGTCGCAGATATTCAATAAA 
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TABLE 3 - Human SARA2 Continued 



ATGGCJUVCCTGTTATAATTTGTGAAATTTATTGAAA.TGGTGTAAGATGAAAACAATTG 
CATATCAAACCCAATTTATGTTTTCTAAATATAGTGTATGTATTCTGCCATGTAAGTA 
ATTGAACAGTCTTAAAATAACCAAATGGTAGAGGGCTGTTCCATGATGGGACAGCTTT 
GGATTTGTTTTCATAAAATCTCTACATTCAATAAAAATTGGAATTATGTGCCTGAAGT 
TTGGAGGCAC A TTT T GAAGT 
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TABLE 4 - hmnan SARA2 - Semienea ID WO 1 4 

MDS YFKAAVSDLDKLLDDFEQNPDEODYLCDVONAYDSNHCSVSSELASS OET ~ LL?K 
pOEC^S^SETSYGTNESSLOTKTLKGLTSIQNEKNVTGLDLLSSVDGGTS-^IOP 
LYMGRCSKPICDLISDMGNLVHATNSEEDIKKLLPDDFKSNADSLIGLDLSSVSDTPC 
^ T ^™f°I^ Q0OTTSSEL Q^ IG =I^GIKVDTTLSDSYNySGTENL:^KKI 
FNQLESIVDFNMSSALTROSSKMFHAKDKLOHKSQPCGLLKDVGLVKEEVDVAVIT-AA 

IQDAVTIHEEIQHSVVLGGEPFKENDLLKQEKCKSILLQSLIEGMEDRKIDPDOTVIR 

^?^^®^^^^^'^Q^^ i ®^^^2SSDCCEGFIlJTFSSNDMDGODLD'YFN'IDEGA 

KSGPLISDAEL2DAFLTEQYLQTTNIKSFEENVNDSKSQMNQIDMKGLDDGNINNIYFN 

AEAGAIGESHGINIICETVDKQNTIENGLSLGEKSTIPVQQGLPTSKSEITNOLSVSD 

INSQSVGGARPKQLFSLPSRTRSSKDLNKPDVPDTIESEPSTADTWPITCAIDSTAD 

PQVSFNSNYIDIESNSEGGSSFVTANEDSVPEirrCKEGLVLGQKQPTWPDSEAPNCM 
NCQVKFTFTIOUUmCRACGKVFC^ 

MMS PTGSNLKSNHSDECTTVQPPQENQTS S I PS PATLPVSALKQPGVEGLCSKEQKRV 

WFADGILPNGEVADTTlOiSSGSKRCSEDFSPLSPDVPMTVin^HSHSTTVEKPl^T 
™ITOTXIQSFISQVPSVra*MTOHEra^ 

NSNLP IAS ISDYRLLCDINKYVC3JKISLLPNDEDSLPPLLVASGEKGSVPWE - - «SH 

EQ^IiLEGEGFHPVTFVLNANIiVinnCFIFYSSDKrwyFSTHGIJIGLGOAE 

CLPNEDTIPKDIFRLFITIYKDALKGKYIENLDNITFTESFLSSKDHGGFLFttb TFQ 

KLDDLS LP SNP FLCG IL I OKLEI PWAKVF PMRLMLRLGAE YKAYP APLTS I RGRKPLF 

GEIGHTIMNI^VDLRNYQYTLHNIDQI^IHMEMGKSCIKIPRKKYSDVMKVLNSSWEH 

VIS IGASFSTEADSHLVCIQNDGIYETQANS ATGHPRKVTGASFWFNGALKTS SGFL 

AKSSIVEDGLMVQITPETMNGLRIALREQKDFKITCGKVDAVDLREYVDICWVDAEEK 

GNKGVISSVDGISLQGFPSEKIKLEADFETDEKIVKCTEVFYFLKDQDLSILSTSYQF 

AKEIAMACSAALCPHLKTLKSNGMNKIGLRVSIDTDMVEFQAGSEGQLLPQHYLNDLD 

SALIPVIHGGTSNSSLPLEIELVFFIIEHLF wuu*uo.~±muuu 
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TABLE 5 - XSARX1 - Sequence TP NO s 5 

C T G TAAGTTTG ACTATGTAGGAAAG CATTTCTGTT AT CTATG AAGTATGTTTTAG AG T 
CAGACCAATAACTAAACGG i 1 TTUTTTTTTTT GTTTATTTZCCCTCAGATGAGACTGT 
CTCTCCAAAGCTATTAGATGCTAAGTGGAATCAAATCTTAGAACCGCATTCACATAAA- 
GTCGCTGATAACTCCGCCCTTGACAATGTCTGTAAATCAATCATTGCTATTGAAGCTC 
ATCTCAAAGTCAGGTCACCCGGCTTGTCAGCCCTTGTGAGGTCCACATATGTGAATGG 
AGAAGTAGGTATTGTGGCACCTGAAATGCCCAAAATGGTGATAGGAGACACCATTATG 
GCAGAGGATTCACTTTTTAACAACACTGGTCCCTCTGAAATTGTATGCAACCCATCTA 
CTGTGGAGAGTCAAAGTTTACAAGCTTTAGATGATCAATCAGTGAATATTCACAATGA 
AAAAAGTGTTCTGCTCGCTGATGGCTTTTCACCATGCAGTAGCCCCAAAAGTATTATA 
AACTTTGACTGCTTGACCATGGATAACGAAATGCCTTTGCACAGTCAAATGAGTGTTG 
ATGACAATGACAAAGAAACTGTAACAATTTCAGTCCTTCCAACAATCATACAGGATAC 
TAGTAACGTAAGCACAGACCCAGCTATCAATAAACCTGGCACTAAAGAACCCCATAGA 
GCATTAAAGGAAACCACATCAGTTATTCTGCCTGAAATAAAGCCTTACTCCACATGTG 
CTGCCCTTTCGTTTGAAAATAACAATAAGGTTCCCAGTTATCAATTAAATAATACAGA 
TCTACTCAGCGTTTCACCAGTGGTTGAAGCATGTAGTGAGCAGCAGCAAAAACATACA 
TCTTCCTTGCATGAAGAAAAACTTTTTGAAGGTGTTTCTGCAACGGAGTCCTTTGCAG 
CCACTGCTGCGGAAACTGTACTGGATAATGAGGCTCTCCGTAGTGCTGAATTCTTTGA 
CATTGTTGTAAAGAACTTTTCTGACTCTTGTGTGATTAATGGCGACTTGACTAAAAGT 
TGTGGCCTCTCTCAAGAAAGCAATGAAAAGTTTTGTGCAAGTAAAGAGTTTGAAGGAG 
GGGTAGATGCTAATGTCTTGTTGGAAAATGCATGTGTAGCTTATAAAGAAGCAATAGA 
TTTGCCTGAAGAAAATGGAACTAATGCACCAATGTCTCTGTACAATGGGTGTGATTCC 
TATGGAATGAAAAACCCAGCCGTAGCTCAAAACCCAAAGAATTTACCTTCAAAAGAAG 
ATTCTGTGACAGAAGAAAAAGAAATTGAAGAAAGCAAGTCAGAATACTATACTGGTGT 
TTATGAACAACAAAGAGAAGATGATGTTACAGAGAGAGGTGGACTTCTGTTAAATGCT 
AAGGCTGACCAAATGAAGAACAATTTGCATAGTCTTTGTAATCAGGTTCCATCCATGC 
ATGGGCAAACATCACCAAAAAAGGGCAAGATTGTGCAATCTCTCAGTGTTCCATACGG 
TGGAGCACGCACTAAGCAGCCAACTCATCTCAAACTCCATATTCCAAAGCCATTGTCT 
GAAATGTTGCAGAGCGATCTCATTCCTCCAAATGCTGGCTGCAGCTCTAAATACAAAA 
ATGACATGTTAAACAAATCAAATCAGGGGGATAACCTGATTTCAGAATCACTGCGTGA 
GGATTCTGCAGTGCGCAGCCCTGTTACTGATGCTAATGGTGATTTCCCTGGAGAATAC 
AGGGGACCTGGCAGCTTGTGCCTTGCAGTGTCTCCAGACAGCCCAGACAACGATCTGC 
TTGCCGGGCAGTTTGGGGTACCCATCTCTAAGCCATTTACTACTCTAGGGGAAGTGGC 
TCCAGTCTGGGTGCCAGATTCCCAAGCACCAAACTGCATGAAGTGCGAGGCCAGATTT 
ACATTTACCAAAAGGAGGCATCACTGCCGAGCTTGTGGAAAGGTGTTCTGTGCTGCTT 
GTTGCAGTCTAAAATGCAAACTACAGTACATGGATAAAAAGGAGGCTCGTGTGTGTGT 
TATTTGTCATTCTGTGCTTATGAATGCTCAAGCATGGGAGAACATGTTAAGTGCATCG 
GTCCAAAGCCCAAATCCAAATAATCCTGCTGAATACTGCTCAACTATCCCTCCGATGC 
AGCAGGCACAAGCTTCAGGAGCACTGAGTTCCCCACCTCCCACTGTCATGGTGCCAGT 
GGGTGTGTTAAAACATCCAGGAACTGAAGGGTCACAGTCAAAGGAACAGCGCCGTGTT 
TGGTTTGCTGATGGAATATTACCCAACGGAGAGACTGCTGACTCAGATAATGCAAACG 
TAACTACAGTGGCTGGGACACTTACTGTGTCACATACCAACAATTCCACATCTTCAGA 
GTCTGAGAACACCTCTGGATTCTGTGGAAGTATAACTCAGGTTGGCAGTGCAATGAAC 
CTTATTCCAGAAGATGGGCTTCCTCCTATACTAATCTCTACTGGAGTAAAAGGAGATT 
ACGCAGTTGAGGAACGCCCTTCCCAGATGTCTGTGATGCAGCAACTAGAGGAAGGAGG 
ACCAGATCCTTTGGTTTTTGTTCTAAATGCAAATCTTTTGGCCATGGTTAAGATCGTG 
AACTATGTTAACAGGAAATGCTGGTGCTTTACTACAAAGGGAATGCATGCAGTGGGCC 
AGGCTGAGATCGTAATCCTGTTGCAGTGCCTGCCTGATGAGAAGTGCCTGCCGAGGGA 
CCTGTTTAGCCATTTTGTTGAGCTGTATCAGGAGGCAATTGCAGGTAATGTAGTGGGG 
AACCTGGGGCATTCZTTCCTGAGCCAGAGTTTCCTGGGTAGTAAGGATCATGGTGGAT 
TTCTTTATGTTGCACCAACCTACCAGTCCCTCCAGGACCTGGTTCTTCCTGCAGAGCC 
GTACTTGTTTGGAATCCTTATTCAAAAGTGGGAGACTCCATGGGCCAAAGTGTTCCCC 
ATTCGGCTTATGCTGCGTTTAGGTGCAGAATACAGATTGTACCCATGTCCACTCTTCA 
GTGTTCGATACAGAAAACCTGTGTTTGGGGAAACCGGACACACCATCATTAATGTTCT 
AGCCGATTTCAGAAACTATCAGTATACTCTGCCAGTGGTGCAGGGCTTGGTGGTGGAT 
ATGGAAGTCAG AAAAACT AG CATTAAAATC C C CAG C AAT n-jn, x AATG^-wuATGATGA 
AAGCAATGAACAAAT Z GAATGAGCATGTGTTGGCCA' CAGAT 
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TABLE S - XSAttAl Contlngad 

GGCAC^CTCTCACCTTGTGTGTGTGCAAAACGATGATGGCAATTACCAGAC' crra 

ATTAGTATCC^CAAAC^CCACGTAAAGTGACOT^ 

gtgcactaaagtcttcttccggatacctggccaaatc^ca^gS 



^E^^^^^^^^^^^^^^^^^^^^^^^^^^^^caagaccagcctaagcc^tcagtc 

ataccaatc^cttccouuvccacaagc^tgaatccaacgtt^ 
ctgctgtgaaagggacagcaatattattot^ 

tctgtgtctctcatgggggatttgttgcctgatgggSgSt^agIgISgS??c 
cacccgtgtgtaatatacagtctaagtgtatggtct 

GTGCAA^C^CTGACTC^^ 
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TAB LB 6 - XSARA1 - Sequence XD NO»6 
MPror^IGDTIMAEDSLFNOTGPSEIVOTPSTVESOSLOALDD^ 

FSPCSSPKSIINFDCLTMDNEMPLHSQMSVDDNDKETVTISVLPTIIODTSNVSTDPA 

INKPGTKEPHRALKETTSVTLPEIKPYSTCA^ 

EACSEQQQIOITSSLHEEICLFEGVSATESFAATAAETVLDNE^ 

SCVINGDLTKSCGLSQESNEKFCASKEFEGGVDANVLLENACVAYKEAIDLPEENGTN 
APMSLYNGCDSYGMKNPAVAONPKNLPSKEDSVTEEKEIEESKSEYYTGVYEQQREDD 
VTERGGLI*I^AKAD0MIO31IIjHSLCNQVPShraGQTSPKKGKIV0SLSVPYGGARTK0PT 
HLKLHIPKPLSEMLQSDI*IPPNAGCSSKYKNDM 

TDANGDFPGEYRGPGSLCLAVSPDSPDNDLLAGOFGVPISKPFTTLGEVAPVWVPDSQ 
APNCMKCEARFTFTKRRHHCRACGKVFCAACCSLKCKIiQY 

AQAWENMLS AS VQ S PNPNNP AEYCSTI P PMQQ AQ AS GALS S P P PTVMVP VGVLKHP GT 
EGSQSKEORRVWFADGILPNGETADSDNAinrrTVAGTLTVSHTIJNSTSSESENTSGFC 
GSITQVGSAMNLIPEDGLPPILISTGVKGDYAVEERPSQMSVMQQLEEGGPDPLVFVL 
NANIJjAMVKIVITYTNRKCWCFTTKGMHAVGQAEIVILLQCLPDEKCL 

YQEAIAGNWGNLGHSFLSQSFLGSKDHGGFLYVAPTYQSLQDLVIiPAEPYLFGILIQ 
KWETPWAKVTP IRLMIJtfiGAEYI^YPCPL^^ 

TLPWQGLWDMEVRKTS I KI P SNRYNEMMKAMNKSNEHVLAI G ACFNQMAD S HL VCV 
QNDDGNYQTQAISIHKOPRKVTGASFFVPSGAIjKSSSGYIJUCSSIVEDGVMVQITAES 
MDALRQSLREMKDFTITCGKADAEESQEHVYVQWVE 

SVKIFHGSEYKASGKIIRWIEVFFLDNEEQQSGLSDPADHSRLTENVAKAFCIJ^CPH 

LKLLKEDGMTPJjGLRVSLDSDQVGYQAGSNGQLLPARYTNDLDGALVPVIHGGTCQLS 
EGP VSMEL I FY I LENT S * 
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TABLg 7 - XSARA 2 - Saonenea TP NO: 7 

I A J AA 2SE A ^^ 

J?JS£i™£II CCTCCTCAGATGAAACT ^ 

G ^ A ^SE^?^^^^ GAA ^^^^^ G ^ A ^ AA ^5TCACTGATAAACCAGCTCTTGACAAT 
G J C * G I AAA I? ATCATTGCTA "^ 

C ^ C ^S™5 G I CCACATATGTGAATGGAGAAG TAGGTATTGTGACTCCTGA^ 
CCCTCTGAAATTGTATGCAACTCTATTGTGGAGAGTCAAAGTTTAGAAGTTTTAGATG 

atgtacc^tgagtattaacaatgaaaaaagtgttct^ 

ctacagtagccccaaaagtcttctaaactctgcS^ 

;H^S c SS tcaaaaaa ttgttaatgaccaaS 

A * G ?E G 5^^^^ < ^^^ G cttatag tg cattaaaacaaaccacat c agttatt ctgcct 
g aaataaagccttattcc^tacaggctgccctttcatg^ 

C CAGATGTCAATTAAATAATACAGAT CTACTCAG CAT^CAC^G TGGTTG A^G CATG 
I^ G ^E^ G £ G ^ C ^ C ^ GT ^ CT GCTGCTGAAACTGTACTAGGTAATGAAGCTC 



* AG £I? A J AAAGAAAGA ™ 

T^ G ?i!t GA ^^ GGG ^ G ^ GA ^^ G ^ A ^ GGAA ^ GAAAAA ^^^2CTGTACGTCAAAACCCA 
AA ? A £^ AGCATCAAAAGAA ^^ 

AG JJ GAG ^ A ^ A ^E ^ GTGGTG ^^^^ G AACAACAGAAGGAAGATGACATAACTGAGAG 
AGGTGGAGTCTTGTTAAATGCCAAGSTTGarraanTnaan&a f» rTwrrT! r^™-* 



A A ^'-CATGCATGGGCAAACCTCACCAAAAAAGGGCAAGATTGTGC 

^^»^ GG ^ GA ^^ G ^ GGA ^ A ^ GG ^ GGAGG ^^ G ^ GGGAA ^^^^^^^^^^ATCTCAAACT 
C ^ TA ^ CC ^ CAGCCATTGTCTGAAA 




FS^II^ GAATCACTACGT5AGG ^ G TGCACAGCCCTGTTACTGATACAAATGGTGA 
A ™ C r5 C S AGAAAACAGGGGACCTGGCAG CCTGTGCCTTGCAGTGTCTCCAGACAGC 
^ GA ^J GA I^ G ^^ C -= G -CAGTTTGGGGTACCCATCT2TAAGCCATTTACTA 

G ^ G ^ GG ^ GA: ^ A ^^ ACCAAA A GG AGGCATCACTGCCGAGCTTGTGGAAAG 

f; A I G ^™S AAATGTGGTGT ^ CATCAGGGCAACAG TAATCACGGCAAA 

A ^ CA ^I G I GTT ^ GC ^ GATTCAGT ^AAA G TAGACTTATAAGTTACACAGTAACAAT 

^^™ C F C ^ GCCTCATTOGAAGTAGATAAAATATA ^TATTAGGAAACTCTGGG 

G ^ GA I A I AAGGGAAAGCTTGCC ^^ 

AGA T GA ™^? G ^ AAA ^ 
A iE AAAAAG I^ 

CAAATGCTTAGTGCCTTCACACAAAGTGCCTTTAATAAAAATTATTTTATAAATTATC 

- ~ AG ATTTACATGTTGTACAACAGTTATTG CAATATGTGT 
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TABLE 8 - XSARA2 - Sequence ID NO 1 8 

MPKWVIGDTDMAEDSLFNTGPSEXVCKS IVESQSLEVLDDVPVSINNEKSVLLDDGFS 
P YS S PKS VLNS ACLTMNNGKPSHGQKI VNDQDKEAVTI SVLPMII QDTTNVS TD PAFN 
KSGTEEAYSALKQTTSVILPEIKPYS IQAALS CENINKI PRCQLNNTDLLS I S PWEA 
CSEKQQNHTTSLHEKKIAAVSATAFFPVTAAETVLGN^^ 

FNGDLTRTNGLSQENNEMFYASKELEGGVDANILLEDACIAYKERIDLSEENGTNAPM 
YLYNGCDS YGMKNPAVRQNFK2ILPSKEDSVTEEKEIEESKSEYYSGVYEQQKEDD I TE 
RGGVXiLNAKVDQMTOISLHSLYOTVPSOT 

LNIPQPLSEMLQCDLIFPNAGCSSKNKNDMLNKSITOGDNL^^ 

EVPRENRGPGSLCIAVSPDSPDIIDI^GQFGVPISKPFTTLGDVAPVWVPDSOAPNCM 
KCEARFTFTKRRHHCRACGKVCKEMWCFIRATVITANYS 
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TABLE 9 



hSARA 



ftSARA 
XSARA 



hSARA 
XSARA 



hSARA 
XSARA 



hSARA 
XSARA 



hSARA 
XSARA 



hSARA 
XSARA 



hSARA 
XSARA 



hSARA 
xSARA 



AC I V I LLQCLFPEKCLP WP L F1 5 HF VE LY QE k I IAON VVI GNLQHSF I SOS FIGSICPHGCF1.*YV Sky 



JTwcDsgaaevci aSagT 



1 



a 47$ 

d in 



^ ^ ^ ^ 



rQS LQDLVLP 
rOSLQDLVLP i 



»)E PYLF 



tS3 
MS 



w w> gKWC TFWAKy FP> t R LMLHLtSAEY^VYPCPL F>VWTW K> L FOETOHT4' t fa V L AD F WW Y Q Y T l"P V V Q Q L V VP MR V H KT1 MS 



hSARA BIK»_f SMR Y NEMMK A UN 
XSARA L - — . 



M "I^AMNICSNEHVLAl .PACFIloU^ ^ 



OALKSSSOYI 1025 

H K ***^^" , *"***Q ) f^AE ALB OS LRE M KDFT1 TCQKADAEEgPE^ VrfvbwV €Pol(»li F KKGVi^P I OGKSME SviHslyij 110S 



hSARA 
XSARA 



AB PlMljijjj I PEVFF COM 6 E O a S&lLSPF APHS RLTE WV AKA FCt. AlTcV ML K LLK C PCMTB LG L 



LRV.TLDSDOVG1 



HVSLDSDOVCYl 111S 



hSARA pAGiHG<^P(L^rOYVf7oTDUSTv> KfTMOG 
XSARA QAQS MGOiL L.P A 



W-MOLD|SffcTv^K|l MOGACQUfiEGPvlvklfeCI FY I LEN I k - 1J33 
£Y1Tt<OLOks ULV» » V II MQQTCQLSEQPV s fclE L TfyTlE M I E . 1235 



TABLE 10 



587 eery ptpv 

737 LCOCTJtai 



hSARA 
XSARA 
KIAA0305 
FGD1 
Mrs 

Hrm.2 

EEA-J 1341TOa7Tn 



1S3AAERArbWVD^^EEb^PWVDFVC^ TR * MMCRACaO I FCCkCSS 6* V P * * G i E K E^Sv ci P CY 'o 




P*W- 



•r&*hotacg->{ rc%-c%-. rvc 



- -C- L 
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WE CLAIM: 

1 . An isolated polynucleotide comprising a nucleotide sequence encoding a 
SARA protein or a splice variant thereof. 

5 

2. The isolated polynucleotide of claim 1 , wherein the SARA protein is a 
mammalian SARA protein. 

3. The isolated polynucleotide of claim 1, wherein the SARA protein is a 
10 non-mammalian SARA protein. 

4. The isolated polynucleotide of claim 2, wherein the SARA protein is a 
human SARA protein. 

15 5. The isolated polynucleotide of claim 3, wherein the SARA protein is a 
Xenopus SARA protein. 

6. The isolated polynucleotide of claim 1, wherein the nucleotide sequence 
is selected from the group consisting of 
20 (a) a nucleotide sequence encoding the amino acid sequence of 

Sequence ID NO:2; 

(b) a nucleotide sequence encoding the amino acid sequence of 
Sequence ID NO:4; 

(c) a nucleotide sequence encoding the amino acid sequence of 
25 Sequence ID NO:6; 

(d) a nucleotide sequence encoding the amino acid sequence of 
Sequence ID NO:8; and 

(e) a nucleotide sequence encoding a SARA protein and capable of 
hybridising to a sequence complementary to the nucleotide sequence of 

30 any of (a) to (d) under stringent hybridisation conditions. 



10/05360 PCT/CA99/00656 

64 

7. The isolated polynucleotide of claim 4 comprising the nucleotide 
sequence of Sequence ID NO:1 or a degeneracy equivalent thereof. 

8. The isolated polynucleotide of claim 4 comprising the nucleotide 
sequence of Sequence ID NO:3 or a degeneracy equivalent thereof. 

9. The isolated polynucleotide of claim 3 comprising the nucleotide 
sequence of Sequence ID NO:5 or a degeneracy equivalent thereof. 

1 0. The isolated polynucleotide of claim 3 comprising the nucleotide 
sequence of Sequence ID NO: 7 or a degeneracy equivalent thereof. 

11 . An isolated polynucleotide comprising a nucleotide sequence of at least 
1 0 up to the total number of consecutive nucleotides of a sequence selected 
from the group consisting of Sequence ID NO:1, Sequence ID NO:3, Sequence 
ID NO:5 and Sequence ID NO:7 or a nucleotide sequence complementary to 
any one of said sequences. 

1 2. An isolated polynucleotide comprising a nucleotide sequence encoding at 
least one functional domain of a SARA protein. 

13. The isolated polynucleotide of any one of the preceding claims wherein 
the polynucleotide is a polydeoxyribonucleotide. 

14. The isolated polynucleotide of any one of claims 1 to 1 1 wherein the 
polynucleotide is a polyribonucleotide. 

15. An isolated polynucleotide encoding a SARA protein FYVE domain. 

16. A recombinant vector comprising the isolated polynucleotide of any one 
of claims 1 to 15. 
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1 7. A host cell comprising the recombinant vector of claim 1 6. 

1 8. A process for recombinant ly producing a SARA protein or a fragment 

5 thereof comprising culturing the host cell of claim 1 7 under conditions whereby 
the SARA protein or fragment thereof is expressed and isolating the expressed 
SARA protein or fragment thereof. 

19. A substantially pure SARA protein. 

10 

20. The protein of claim 19 which is a mammalian SARA protein. 

21. The protein of claim 19 which is a non-mammalian SARA protein. 

15 22. The protein of claim 20 which is a human SARA protein. 

23. The protein of claim 22 comprising the amino acid sequence of Sequence 
ID NO:2 or Sequence ID NO:4. 

20 24. The protein of claim 21 comprising the amino acid sequence of Sequence 
ID NO:6 or Sequence ID NO:8. 

25. A SARA protein that is at least 50 percent identical in amino acid 
sequence to the sequence of Sequence ID NO:2 or Sequence ID NO:4. 

25 

26. The protein of claim 25 wherein the SARA protein has a FYVE domain 
having at least 65 percent identity in amino acid sequence to the FYVE domain 
of hSARAI (Sequence ID NO:2) and a C-terminal sequence of 550 consecutive 
amino acids which have at least 50 percent identity to the C-terminal 550 amino 

30 acid residues of hSARA! . 
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27. The protein of claim 25 wherein the SARA protein has an FYVE domain 
having at least 65 percent identity in amino acid sequence to the FYVE domain 
of hSARAI (Sequence ID NO:2) and wherein the portion of the SBD 
corresponding to amino acid residues 721 to 740 of hSARAI has at least 80 
percent identity with that portion of hSARAI. 

28. A substantially pure polypeptide comprising an amino acid sequence of at 
least 4 up to the total number of consecutive amino acids of a sequence selected 
from the group consisting of Sequence ID NO:2, Sequence ID NO:4, Sequence 
ID NO:6 and Sequence ID NO:8. 

29. A substantially pure polypeptide comprising at least one functional 
domain of a SARA protein. 

30. A substantially pure polypeptide selected from the group consisting of 

(a) SASSQSPNPNNPAEYCSTIPPLQQAQASGALSSPPPTVMVPV 
GVLKHPGAEVAQPREQRRVWFADGILPNGEVADAAKLTMNGTSS; and 

(b) amino acids 589 to 672 of the XSARA1 sequence of Table 9. 

31. A substantially pure polypeptide comprising a SARA protein FYVE 
domain. 

32. The polypeptide of claim 3 1 comprising a polypeptide selected from the 
group consisting of 

(a) amino acids 587 to 655 of the hSARAI sequence of Table 9; 

(b) amino acids 510 to 578 of the XSARA1 sequence of Table 9; 

(c) the consensus amino acid sequence of Table 1 0; and 

(d) a functional fragment of a polypeptide of any of (a) to (c). 



33. A substantially pure polypeptide comprising a SARA protein TGFp 
receptor interacting domain. 
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34. The polypeptide of claim 33 selected from the group consisting of 

(a) amino acids 751 to 1323 of the hSARAI sequence of Table 9; and 

(b) a functional fragment of polypeptide of (a). 

5 

35. A substantially pure antibody which selectively binds to an an antigenic 
determinant of a SARA protein. 

36. A cell line producing the antibody of claim 35. 

10 

37. A method for identifying an allelic variant or homologue of a human 
SARA gene comprising 

choosing a nucleic acid probe or primer capable of hybridising to a 
human SARA gene sequence under stringent hybridisation conditions; 
15 mixing the probe or primer with a sample of nucleic acids which may 

contain a nucleic acid corresponding to the homologue variant or homologue; 
and 

detecting hybridisation of the probe or primer to the nucleic acid 
corresponding to the variant or homologue. 

20 

38. A method for modulating signal transduction by a TGF0 superfamily 
member through a SARA protein-dependent pathway, the method comprising 
modulating the binding of the SARA protein with its binding partner. 

25 39. The method of claim 38 comprising a method selected from the group 
consisting of 

(a) modulating the binding of the SARA protein to a Smad binding 
partner; 

(b) modulating the binding of the SARA protein FYVE domain to its 
30 binding partner; and 

(c) modulating the binding of the SARA protein to the TGFp receptor. 
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40. A method for preventing or treating a disorder characterised by an 
abnormality in a TCFp superfamily member signaling pathway which involves a 
SARA protein, the method comprising modulating the binding of the SARA 

5 protein involved in the pathway with its binding partner. 

41 . A method for screening a candidate compound for its potential as a 
modulator of SARA protein-dependent signaling by a TGFp superfamily member 
comprising 

10 (a > determining the ability of the compound to bind to a SARA 

protein; and 

(b) determining the ability of the compound to alter the 
phosphorylation state of a SARA protein. 

15 42. A non-human transgenic animal comprising a polynucleotide encoding a 
heterologous SARA protein or a portion thereof. 

43. The transgenic animal of claim L01 wherein the polynucleotide encodes a 
human SARA protein or a portion thereof. 

20 



44. A non-human animal having a genome from which the SARA gene has 
been deleted. 
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